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,HIV ENVELOPE' POLYPEPTIDES AND VACCINE \ 

B^rkhROtlND n f THF T-NVENTION- .; 

"fj «»ia of " '^»'' Tn vent ion :v; 

This invention relates to HIV envelope 
/polypeptides and vaccines containing the polypeptides. 

n^rvririti '-T ?f the. Related Art _ . . , : 

Acquired immunodeficiency syndrome (AIDS) is 
caused by a retrovirus identified as the human ; : 
immunodef iciency virus (HIV) . There .have been intense 
efforts to develop a vaccine that induces a protective 
■ immune response based, on induction of antibodies or 
; cellular responses. Recent efforts have used- subunit 
■vaccines where an HIV protein, rather, than attenuated 
or killed virus, is used : as the immunogen in the 
: vaccine for safety reasons . Subunit ..vaccines, generally 
/include g P 120 r the portion of the HIV envelope protein 

which is on the surface of the virus. V 
: :: The HIV envelope protein has been extensively 
•■described, and the amino acid and nucleic acid 
• sequences encoding HIV envelope from a number of HIV 
^strains are known (Myers, g/ et al. ,^1992 . Human, 
Retroviruses and : AIDS . A compilation and- analysis of 
nucleic acid and amino, acid, sequences . Lbs Alamos . .. 
National Laboratory, Los Alamos, New; Mexico) . The HIV 
envelope protein is a: glycoprotein of about 160 kd 
1 (gpiSO) which is anchored in the membrane : b ; i layer at : 
: its: carboxyl, terminal.; region. The N -terming, segment,, 

gpl2b, protrudes into the aqueous environment: ; 
ft surrounding the virion and the C-terminai segment, . 
-gp4i, spans the Membrane, Via a host-cell mediated 
;: •process,: gpl60 is cleaved to form gpl20 and the : . 
'integral membrane protein gp4l.. As- there vis : no . ... ■ 
»:* covaient ■ attachment, b^ 

Is- sometimes released from the surface of : virions and 
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inf ected cells'... •■ 

The gpl20 tnolecule consists of a : polypeptide core v 
:: of 60/000 daltons which id extensiVeiy modified by 
: N-iinked glycosyiatioh to increase the apparent 
5 molecular weight of the mole cu 1 e to 1 20 ,000 daltons. 
The amino acid sequence of gpl20 contains five 
relatively conserved domains interspersed with five 
hypervariable domains. The positions of the 18 
cysteine residues in the gp!20 primary sequence, and 
10 the positions of 13 of the approximately 24 N-linked 

glycosylation sites in the gpi20 sequence are common to 
all gpl2 0 sequences. The hypervariable domains contain 
extensive amino acid substitutions, insertions and 
deletions. Sequence variations in these domains result 
15 in up to 30% overall sequence variability between gp!20 
/molecules from the various viral isolates. Despite 
this variation, all gpl2 0 sequences preserve the 
ability of the virus to bind to the :; viral receptor CD4 
and to interact with gp41 to induce fusion of this viral 
20 and host cell membranes. 

gpl2 0 has been the object of intensive 
investigation as a vaccine candidate for subunit 
vaccines, as the viral protein which is most likely to 
•..•..•• : -. : : .v. ::; .be ; accessible to immune attack . At present , clinical ' 
25' trials using gpl20 MN strain are underway . However to 
'• ;:; :'ciate- no human vaccine trial has been of suf f icient size 

: ; to confirm or refute vaccine efficacy. 

The development of candidate HIV-1 vaccines is 
burdened by the lack of in vivo or in vitro models of 
30 HIV-l infection that accurately approximate the 

• conditions of natural infection in humans ; Several 
candidate HIV-l vaccines [ Berman et a l . ; J . Vircyl . 
; 7:4464-9 (1992) ; Haigwood et al. ; J. Virol . 66: 17 2-82 

(1992); Salmon-Geron et a 1 . ; AIDS: Res . and Human 
35 Retroviruses. 11 : 147 9-86 (1995)) have been described 

that elicit broadly cross-reactive antibodies able to 
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neutralize, a variety of diverse HIV-1. isolates in . : 
• ': -vitro ^./However , • : the relevance: of in Vitro assays.. .to.\ : . 
:. • : protective : immunity in vivo is uncertain. Although. 
severaPv provided chimpanzees with 

5 protection f^ challenge by homologous and : 
heterologbiis strains of HIV-l , protection has not 
a Iway s; corre lated with in vitro neutralization assays 
carried out in T cell lines, or in lectin- and 
cy t okirie-act i vat ed peripheral blood mononuclear cells 
10 (PBMCs) [Berman et'al. ; Nature 345: 622-5 (1990) ; Bruck 
et all; Vaccine 12 (12) : 1141-8 (1994) ; El-Amad et al. ; 
AIDS 9 : 1 313 - 2 2 (1995); G i r ard et. a 1 . ; J . Virol.. 
69: 6239-4 8 (1995) ; and Fulz et al; Science 
256 : 1687 -r 169 0 (199 2) ] . While successf ul protection : of 
15 chimpanzees; is encouraging and has historically proved \ 
to he/a reliable indicator of vaccine efficacy:,, the • . 
conditions Pf infection in all experimental : models : of 
HIV-1 infection differ * significantly from natural : ; 
.infection in humans. ■'■•:■:.'• 
20 Experimental: HIV-1 infection in vivo and; : in ; vitro ^ 

both suffer from the limitation .that the in vitro ; . 
. amplification of HIV-l , which is required to .prepare- ■•■ 

: virus stocks for In vitro or in . vivo inf ectivity ■■ 
experiment^, imposes a ; genetic selection that |esult:s ■] 
25 in a spectrum of virus quasi-species that differ from 
the spectrum of variants present in the clinical .. . 
specimens Used to establ ish the culture [Kusumi et al.> 
j. Virol . 66: 875 (1992) ; Meyerhans et al . ; Cell 
■ 58:901-10 (1989) ] . Because of these uncertainties, and 
30. even greater uncertainties related to the amount of 

.. virus transmitted / the site and ce 1 1 type involved in . 
initial replication, and the kinetics of virus: . 
dissemination, the ability of currently available :in 
vitro or in vivo assays to reliably predict vaccine 
35 ef f icacy is questionable. • 

': one. of the candidate HIV^l vaccines; that have ■ 
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frit e r ed h uma n c 1 in i c a 1 . . t r i a 1 s i s ^cbmbinant 
v prepared in Chinese hamster ovary ( CHG) ) eel Is f rpin the 
. (MN-rgpl2 0) (Berman; : et-.i : ;ailv ; ^ r 

Virol . 7 : 4464-9 (1992 ) ) . To d a t e > a p p r 6 x i ma t e 1 y 499 
5 : adults have participated in Phase 1 and 2 

iirimiinogenicity and safety trials of this vaccine... The 
data col iected thus far suggest that MN-rgpl20 is safe, 
immunogenic, and elicits high titers of neutralizing 
antibodies in greater than 95% of individuals immunized 
10 according to a 0,1, and 6 month immunization schedule 
(Belshe et al. ; JAMA 272 ( 6) : 475-80 ( 1994) ■■ McElrath; 
Seminars in Cancer Biol. 6:1-11 (1995).]. However, 
(during the course of these trials, nine vaccinees who 
received MN-rgpl20 have become infected with HIV-1 .. 
15 through high risk behavior. Small trials:, such as 

these, in populations with low rates of infection and \ 
minimally sized placebo control groups do not have 
sufficient statistical power to confirm or refute . 
vaccine efficacy. 
20 . However, effective vaccines based on gpI20 or : 
another HIV protein for protection against additional 
strains of HIV are still being sought to prevent the 
'spread of .this disease. 

25 Description of the Background Art 

Recombinant subunit vaccines are described in 
Herman et al . , PCT/US91/02250 (published as number 
W091/ 15238 on i7 October 1991) . See also; e.g. 
Hu et al., Nature 328:721-724 (1987) .(vaccinia virus- 

3 0 HIV envelope recombinant vaccine) ; Arthur et al., 

•Jv Virol. 63 (12) : 5046-5053 (1989) (purif ied gpi20) ; 
and Berman et al., Proc. Natl. Acad. Sci ; USA 
85 : 52 00-5204 (1988) (recombinant envelope glycoprotein 
gpi20) . . ■ ■ 

35; Numerous sequences for gpl20 are known. The 

sequence of gpl20 from the I I IB substrain of HIV-l^ 
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ref erred to herein : is : that d by: m 

. a i . , "Nucle ic acid structure and expression of the 
human AID S / 1 y in H at di © n P^^iy re t r b v i ru$ f 'H$£ure:_ 
313 : 450-4 58 (198 5 ) v The sequences of gpl20 from the : , : 
5 ny-tS , Jrcsf , Z6 , Z32 1 ; ; and HXB2 strains of HIV- 1 are : ;f : .v 
listed by Myers et al. , "Human Retroviruses and AIDS ; A 
compilation and analysis of nucleic acid and amino acid 
sequences , M Los Alamos National Laboratory, Los Alamos, 
New Mexico ( 1992 )> The sequence of the Thai isolate 

10 A244 is provided by McCutchan et al. , "Genetic Variants 
of HIV-1 in Thailand, " AIDS Res. and Human Retroviruses 
8 : 1887-1895 ( 1992) . The MN i9K4 clone is described by 
Gurgo et al. , "Envelope sequences of two new United 
States HIV-1 isolates," Virol, 164: 531-536 (1988)* As 

15 used herein, MN, MN-rgpI20, the MN clone or isolate 
refers to MN ONE . : The MN 0NE amino acid sequence is 
Sequence ID No. 29. 

Each of the above-described references is 
incorporated herein -by reference in its entirety. 



20 



Summary of the Invention 
Oligonucleotide sequences encoding gpl 2 0 
polypeptides from breakthrough isolates of vaccine 
trials using MN-rgpl20 and the encoded gpl20 

25 polypeptides are provided. Use of the gpl20 

polypeptides from one or more of the isolates in a 
subunit vaccine, usually together with MN-rgpl20, can 
provide protection against HIV strains that are 
sufficiently different from the vaccine strain (e.g. ; 

30 MN-rgpl20) that the vaccine does not confer protection 
against those strains. Antibodies induced by the 
polypeptides are also provided. 

Brief Description of t he Drawings 
35 Figure l illustrates the kinetics of antibody 

response to MN-rgpl 2 0 in vaccinees infected with HIV-1. 



-5- 



WO 98/01564 



PGT/US97/09690 



Sera were : collected at the time points indicated and ; 
assayed ibir antibodies reactive :; ; wxt:h- MN--'rgpl ; 2 0 (open : : 
■ circles) or a synthetic peptide derived from the V3 ; 
domain of : MN-rgpil20 (closed circles) Arrows indicate 
5 dates of injection. Plus sign indicates the first time 
HIV-1 infection was detected. Shaded area indicates 
data collected after HIV-1 infection. Data from 
vaccinee G6 is shown in panel A ; CB in panel B ; C7 . 
panel C ; Cll , panel D; CIO, panel E; C17, panel F; arid 

10 C15 ,. panel G. 

Figure 2 illustrates the kinetics of CD4 blocking 
antibody response in vaccinees infected with HIV-1. 
Sera were collected at the time points indicated and 
assayed for antibodies able to block the binding of 

15 [ ,25 I] -labeled MN-rgpl20 to cell surf ace CD4 . Arrows 
indicate dates of injection. Plus sign indicates the 
first time HIV-1 infection was detected. Shaded area 
indicates data collected after HIV-1 infection. Data 
from vaccinee C6 is shown in panel A; C8 in panel B; 

20 C7, panel C; Cll, panel D; CIO, panel E; C17, panel F; . 
and CIS, panel G. 

Figure 3 illustrated predicted amino acid 
sequences of envelope glycoproteins (gpl20) from 
breakthrough viruses ; Proviral DNA sequences were. 

25 : amplified by PCR from PBMCs' and cloned into the PRK5 
expression plasmid. Twb clones from each infected 
vaccinee were sequenced from double stranded plasm id 
DNA. Sequence numbering is with reference to the 
initiator methionine residue of gpl20. For the purpose 

30 of comparison, the sequences shown begin at amino 
acid 12 of the mature , fully processed , envelope 
glycoproteins (corresponding to position 41 of the 
gpl20 open reading frame) / Shaded areas indicate 
sequences at neutralizing epitopes, dark boxes indicate 

35 .. polymorphisms thought to be important for the binding 
of virus neutralizing MAbs reactive with MN-rgpl20 . 
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Conserved (C) regions and variable (V) : regions are 
/ ^indicated above the sequences . , Boxes;: indicate sequence 
homologies • and polymorphisms . • , .i^; 

• Figure; 4 illustrates immunpprecipitation of 
5 , recombinant gpl2CV prepared from breakthrough viruses. - 
Recombinant gpl20s from the seven breakthrough viruses 
were prepared by transient transaction of 293s cells. 
Cells were metabolically labeled with ,5 S : methionine and 
growth conditioned cell culture supernatants were 
10 immunoprecipitated with polyclonal antisera to 
MN-rgpl20.. Immunoprecipitates were ; resolved by 
SDS-PAGE and visualized by autoradiography . C8 lanes 
and b correspond to clones C8.3 and C8.6; C6 lanes 
... and b correspond to clones C6.1 and C6.5| C7 lanes a 
15 'and b correspond to clones C7. 2 and C7.10; C17 lanes a 
and b correspond to C17 . 1 and C17..3; C11 lanes a and b 
( correspond to clones Cl 1.5 and cilv?^ CIO. lanes a and b 
• ' correspond to clones CIO. 5 and CIO. 7; CIS lanes a and b 

correspond to clones CIS. 2 and C15. 3, : . 
2 0 .Figure 5 illustrates binding of monoclonal 

antibodies to recombinant gpl20 from breakthrough 
viruses . Growth-conditioned cell culture supernatants, 
were collected from 293s cells transiently transfected 
p-- With piasmids directing the expression of breakthrough 
25 virus envelope glycoproteins. The relative rgpl20 

concentrations were determined by ELI SA using MAb 5B6 
/specific for the HSV-i glycoprotein D flag epitope at 
: the amino terminus of ail of the rgpl2.q variants 
; : ' : y described herein. The resulting rgppqV preparations 
30 were captured onto wells of microt iter plates coated 
.J'; " with a polyclonal antibody specif ic for a conserved : 
sequence in the C-terminus of gpl20. The binding of 
virus neutralizing monoclonal antibodies reactive with 
, :: ;, gpi20 was determined by ELISA. A, binding by MAb (5B6) 
35:: specific for the HSV-1 glycoprotein D flag epitope; B, 
.binding by. MAb (1034) against the domain of 
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MN-rgpl2 0 ; C binding ^ MAb ( 50. 1) ra ised : against a ; ^ 
synthetic peptide corresponding to t he V 3 domain o f / ; 
MN-rgpl20 ; D, binding by a human MAb ...( 1 5e ) known tb. 
" block the binding of gpi20 ' to GD4 . .' 
5 Figure 6 depicts the mature envelope glycoprotein 

(gpl20) from the MN clone of the MN strain of HIV-i 
(SEQ. ID. NO. 29) . Hypervar iable domains are indicated 
in bold, and the V and C regions are indicated 
(according to Modrow e t v ai J . Virology 61 (2 ) : 570 
10 (1987). Potential glycosy lation sites are marked with 

. Detailed Description of the Invention 
The present invention provides gpl20 polypeptides 

15 from breakthrough isolates of HIV vaccine trials ; 

Novel oligonucleotide sequences encoding gpl20 from 
breakthrough isolates which can be used to express 
gpl20 are also provided. Use of -gp 1-20- polypeptides 
from one or more of the; isolates in a subunit vaccine, 

20 usually together with MN-rgpl2 0, can provide -protect ion 
against HIV strains that are sufficiently different 
from the vaccine strain (e.g. ; MN-rgpl20) that the 
vaccine does not confer protection against those 
strains.. . ' 

25 In one embodiment /the vaccine is based on the use 

of the MN-rgpl20 polypeptide (Sequence ID No. 29) and 
gpl20 polypeptides from MN- 1 ike viruses that include 
neutralizing epitopes that are not present in the 
initial vaccine strain; and are sufficiently different 

30 from those of the. vaccine strain, to have been able to 
cause HIV-1 infections in MN-rgpl2 0 vaccinated 
individuals (i.e. ; to result in breakthrough 
infections) . Use of the initial vaccine strain 
empirically determines the viruses present in the 

35 population that contain;: additional neutraliz ing 

epitopes sufficiently different from those of the 
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vacc ine strain tb escape protection induced by the .; : 
vaccine strain;. 'vUse- of ' an initial: representative gpl20 
.. polypeptide l-n^a^Vaccine acts as a sieve so thai;. 

viruses that are -not effectively protected against by 
5 the vaccine strain breakthrough the vaccine, 

empirically resulting in determination of additional: 
strains in a given geographic region that are not 
protected against by the initial vaccine strain. Use ;: 
of gpi2 0 from those breakthrough isolates complements . 
10 the vaccine isolate by providing additional 

, neutralizing epitopes not present in the initial 
vaccine strain, therefore creating a more complete 
vaccine that confers protection against multiple 
different virus strains in the region. 
15 Prior HIV-1 vaccine strategies were based on 

selection of appropriate candidate vaccine polypeptides 
based on homology alignment studies . However , s ince 
some of the neutralizing epitopes are conformation- 
dependent and the location of all of these epitopes is : 
20 not known,; this approach necessarily cannot determine . . 
all of the neutralizing epitopes that should be 
included in a vaccine for a particular region. In : 
contrast , the; present approach uses a selected 
representative strain and empirically determines 
25 strains that are sufficiently different and therefore 
breakthrough the barrier of protection provided by the 
initial vaccination program. Those strains can be 
included in the vaccine to confer more complete 
protection from HIV strains in the reg ion . In 
30 addition, those strains can be used; alone to confer ; 
protection against the breakthrough virus. 

In another embodiment, the invention comprises a . 
vaccine containing a first HIV gpl20 polypeptide 
sequence and a breakthrough isolate HIV gpl20 
35 polypeptide sequence from a vacc.inee vaccinated with a 
vaccine including the first HIV gpl20 polypeptide 
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•'••;:/-;' :. : \: : seq:Uence r the HIV gp 12 ©polypeptide sequences being in 

a suitable carrier v'-y ; Fragments of one : i^ryho^ -Ul^:qpl2 0 . 
.polypeptide sequences can be substituted for one or 
; both of the corresponding HIV gpl20 poiypeptide 

5 sequences. • ■ . 

• Preferably, the first gpl20 polypeptide sequence 

contains neutralizing epitopes found in one or wore 
gp!2 0 polypeptides present in isolates from the 
geographical region where the initial vaccine (i.e. , 
10 the vaccine that gives rise to the breakthrough 

isolate) is administered. More preferably^ the first 
gpl20 polypeptide sequence contains at least one of the 
more common neutralizing epitopes for this region, and 
most preferably the first gpl20 polypeptide sequence 
15 contains at least one of the three most common 
neutralizing epitopes. 

gpl2 0 polypeptide sequences suitable, for use as 
the first gpl20 polypeptide sequence include gpl20 MN, 
the Thai isolate A244 sequence (hereinafter !, gpl20 
20 A244") gpl20 MN-GNE6 (Sequence ID No. 31; also known 
in the art as ,f gpl20 GNE6" ) , and g pi 20 MN - G N E 8 
(Sequence ID No. 33; also known in the art as "gpl20 
' GNE8 " ) , and the 1 ike * gp 1 2 0 MN , gpl2 0 MN-GNE6., and , 
: • : gpl2 0 MR-GNE8 are especially preferred for use as the 
25 .first gpl20 polypeptide sequence in initial vaccines 
: for North America . gpl20 A24 4 is especially preferred 
; for use as the f irst gpl2 0 polypeptide: sequence in 
Initial vaccines for Thailand. 

In a variation of this embodiment /the vaccine 
3 0 includes two different: (i. e. , first and; second) gpl20 
^^^^ p^ sequences, or fragments thereof , in 

combination with a breakthrough isolate HIV gpl20 
polypeptide sequence. The latter can be from a 
yaccinee vaccinated with either or both of the first 
3 5 and second HIV gpl2 0 polypeptide sequences.; 

Exemplary vaccines include those containing . 
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combinations of gp 1 2 0 MN, gpl2 6 : A244 ; , gpl20 MN-GNE6 
( S eq u end e I D No.. 31) 7 and ; g{il20^ # ( Sequence ID 7 

No. 33)V Gbmbinations of gpl2p MN a 9P120 A244 or : ;, : ;., 
gpl20 MN-GNE8 (Sequence ID No/ 33) with a breakthrough 
isolate HIV gpl 20 po lypeptide sequ^nce : are especia lly 
preferred. 

In vaccines containing gpl20 : ^N; ..the breakthrough 
isolate HIV gpl20 polypeptide sequence can be an HIV . ; 
gpl 2 0 polypeptide sequence selected ;from : the group 
consisting of Sequence IDNos. 2, 4 7 6, 8 , 10, 12, 14, 
16, 18, 20, 22, 24, 26, and 28, and fragments thereof. 

The term "subunit vaccine" is used herein., as in 
the art , to refer to a viral vaccine: that does not 
contain virus, but rather contains one or more viral 
15 • proteins or fragments of viral proteins. As used 

herein, the term "multivalent" , means that the vaccine 
contains g P 120 from at least two HI V i so 1 ates having 
. different; amino acid sequences. 

The term "breakthrough isolate" or "breakthrough 
2 0 virus" is used herein, as in the art, : to refer to a 
virus isolated from a vaccinee . 

The terms "amino acid sequence" , "polypeptide 
sequence", arid "polypeptide^ are used interchangeably 
■: herein as. :in the art , : as are the terms "nucleic acid 
25 sequence", "nucleotide sequence", and 

"oligonucleotide ". • • 



Polypeptides from Breakth rough Isolates 

The gp!20 polypeptides of this invention 

30 correspond to the amino acid sequences pf seven 

breakthrough isolates which are illustrated below in . 
Table 1 . A polypeptide of this invention includes an 
HIV gpl2 0 amino acid sequence illustrated in Table 1 
(Sequence ID Nos. 1, 3 , 5, 7, 9, 11, -13 ,. 15, 17, 19, ; 

35 21, 23, 25, and 27) and fragments thereof The 
polypeptides of this invention cain include fused 
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; sequences: :frpin two or more HIV gpl2 0 or gp!60 amino 
' ' acid. seq&*tice£v 

The polypeptide can: also be j pined to another 
viral protein/ such as a f lag epitope amino acid '.• 
5 sequence. The term "flag epitope 11 is used herein > as 
in the art,, to denote an amino acid sequence that 
includes an epitope recognized; by a monoclonal 
antibody; Flag epitopes facilitate using single 
monoclonal antibody affinity purification of a 

10 plurality- of different recombinant proteins , each 

having the flag epitope recognized by the monoclonal 
antibody. Numerous amino acid sequences can function : 
as flag epitopes; The N-terminal sequences of Herpes 
Simplex Virus- Type 1 ( HSV-1) glycoprotein D (gD-1) is 

15 conveniently used as the flag epitope and its use is; , ■ 
described in detail in the examples. The flag epitope 
is conveniently fused to the N terminus of the HIV 
gpl20 polypeptide sequence. Alternatively, however, 
monoclonal Antibodies that recognize neutralizing 

20 epitopes in the rgp 12 0 sequences can be used to 

affinity purify the amino acid sequences, and a flag; : : : 
epitope can be omitted . 

In addition, various signal sequences can be 
joined to A;: [\pdiy#4pt&0e . : pf' this invention. Although 

25 rgpl20 is secreted to some extent in HIV cultures, the . 
amount of the envelope glycoprotein released from. 
(secreted by) the host cells varies widely from strain 
to strain. Various; signal sequences can be introduced 
into the polypeptide by joining a nucleotide sequence 

30 encoding the signal sequence to the nucleotide sequence 
encoding the rgpl20 to facilitate secretion of rgpl20 
from the ceils;. : For example., Chiron HIV gpl20 
polypeptides include a signal sequence from tissue 
plasminogen activator (TPA) that provides good 

35 secretion of rgpl^O.: Additional signal sequences: are 
well known and include the N-terminal domain of murine 
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v leukemia virus surface protein gp70 described by Kayinan 
'•et &lJ>M ''^iappl.- 68:400-410 (1984).. '■ 

Table; i illustrates the nucleotide and Reduced::.;.-; 
.:' : amino- acid: sequences for two clones of each th> : : seven'. ; 
5 breakthrough isolates of this invention. The clones 
are: C6.1; C6.5; C8.3; C8.6; C15.2; C15.3; C7.2V; ;. 
C7 . 10 ; Cll . 5 ; Cll . 7 ; CIO . 5 ; CIO . 7 ; C17.1; and C17.3.: ; 
These sequences are SEQ. ID. NOs. 1-28, the first \ 
sequence number for each clone being the nucleotide : 
10 sequence and the second being the amino acid sequence. 
The amino acid: sequence for MN and the nucleotide and 
deduced amino acid sequences for MN-GNE6 and MN-GNE8. 
are illustrated in the sequence listing hereinafter. 
In the listing for MN-GNE6, a stop codon appears at 
15 amino acid residue position 51. This stop codon can be 
replaced with a codon encoding the corresponding amino 
acid. from MN or MN-GNE8 or another isolate. , . 



WO 98/01564 



PCT/US97/09690 



; TABLE -1 . . ' : , 

CLONE C6 . j : . 
GGG GTA CCT GTG TGG AAG GAA GCA ACC ACC ACT CTA 36 :. 
5 Gly Val Pro Val Trp Lys Giu Ala Thr : thr thr Leu 

.... : v..'' 5 : :-,V' ' ^ ^ :^ ^ ' -1^ ^: : . ' r \^ . 

TTT TGT GCA TCA GAT GCT AAA GGA TAT GAC; ACA GAG: GTG 75 
Phe Cys Ala Ser Asp Ala Lys Ala tyr. Asp thr Glu Val 

■ 15' : • 20 ' 2.5 . 

10 CAT AAT GTT TGG GCC ACA CAT GCT TGT GTA CCC AC A. GAC 114 
His Asn Val. Trp Ala Thr. His Ala Cys Val Pro Thr Asp 

30 35 . 

CCA AAC CCA CAA GAA ATG GTA TTG GAA AAT GTG ACA GAA 153 
Pro Asn Pro Gin Glu Met Val Leu Glu Ash Val Thr Glu 
15 40 45 ' , : : 50 

GAT TTT AAC ATG TGG AAA AAT GAC ATG GTA GAA CAG ATG 192 
Asp Phe Ash Met Trp Lys Asn Asp Met Val Glu Gin Met 

55 60 
CAT GAG GAT ATA ATC AGT TTA TGG GAT CAA AG C CTA AAA 231 
20 : His Glu Asjp lie lie Ser Leu Trp Asp Glh Ser Leu Lys 
65 70 • ; 75 

CCA TGT GTA AAA TTA ACC CCA CTC TGT ATT ACT TTA AAT 270 
Pro Cys Val Lys Leu Thr Pro Leu Cys lie Thr Leu Asn 
80 , 85 90 

25 TGC ACC AAT TGG AAG AAG AAT GAT ACT AAA ACT AAT AGT 309 
Cys Thr Asn Trp Lys Lys Asn Asp Thr Lys Thr Asn Ser 

95 . . • . 100 

AGT AGT ACT ACA ACT AAT AAT AGT AGT GCT* ACA GCT AAT 348 
Ser Ser Thr Thr Thr Asn Asn Ser Ser Ala Thr Ala Asn 
. 3 0 ■ 105 • 1 10 . 115 

AGT AGT AGT ACT ACA ACT AAT AGT AGT TGG GGA GAG ATA 387 
Ser Ser Ser Thr Thr Thr Asn Ser Ser trp Gly Glu He 

■ 120 • 125- ;. ... 

.AAG GAG GGA GAA ATA AAG AAC TGC TCT TTC AAT : ATC ACC 426 

35 Lys Glu Gly Glu He Lys Asn Cys Ser Phe Asn lie Thr 

130 135 . 140 

ACA AGC ATA AGA GAC AAG GTG AAG AAA GAA TAT GCA CTT 4 65 

Thr Ser lie Arg Asp Lys Val; Lys Lys Glu: tyr Ala Leu 

■ ' 14 5 ■ 150 155 

: 4 0 TTT TAT AGC CTT GAT GTA GTA CCA ATA GAA AAT GAT AAT 504 

. Phe Tyr Ser Leu Asp Val Val Pro lie Glu Ash Asp Asn. 

160 • : .-^ 65 

ACT AGC TAT AGG TTG AGA AGT TGT : AAC ACC TCA GTC ATT 54 3 

Thr Ser Tyr Arg Leu Arg Ser Cys Asn Thr Ser Vai lie . 

45 170 ' . 175 . 180 

ACA CAA GCC TGT CCA AAG GTA ACT TTT GAG CCA ATT CCC 582 

Thr Gin Ala Cys Pro Lys Val Thr Phe Glu Pro lie Pro 

- .., 185 •' i90.: : : - 

ATA CAT TAT: TGT ACC CCG GCT GGT TTT: GCG ATT CTG AAG 621 

50 lie His Tyr Cys Thr Pro Ala Gly Phe Ala lie Leu Lys 

195 200 205 

TGT AGA GAT AAA AAG TTC AAT GGA ACA GGA CCA: TGC AAA 660 

Cys Arg Asp Lys Lys Phe Asn Gly Thr Gly Pro Cys Lys 

210 \ 215 220 

55 AAT GTT AGC ACA GTA CAA TGT GCA CAT GGA ATT AAG CCA 699 

Asn Val Ser thr Val Gin Cys Ala His Gly lie Lys Pro 

■ 225 ■ .230 

GTA GTG TCA ACT CAA CTG CTG TTA AAT GGC AGC CTA GCA 738 

Val Val Ser thr Gin Leu Leu Leu Ash Gly Ser . Lieu Ala 

. 60 235 : 240/, . 245 : 

GAA GAA GAG GTA ATA ATT AGA TCT GCC AAT TTC TCA AAC 777 

Glu Glu Glu Val He lie Arg Ser Ala Asn Phe Ser Asn 

. 250 255 
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10 



15 



AAT 


GCT 


AAA 


ATC 


ATA 


ATA ; 


GTA 


GAG: 


. TTG ■ 


AGO 


Asn Ala: 


Lys 


lie: 


lie 


lie 


Vai 


Gin 


Leu 


Arg 


260 








265 : 










GAA 


ATT. AAT 


TGT 


ACA 


AGA 


CCC 


AGC AAC 


AAT 


Glu 


lie 


Asn 


Cys 


Thr 


Arg 


Pro 


Sen 


Asn 


Asn : 






275 








280 






GGT 


ATA 


GAC 


ATA 


GGA 


GCA: 


GGG 


AGA GCA 


':tT:t. 


Gly 


lie 


His 


lie Gly 


Pro 


Gly: 


Arg Ala 


Phe 








290 










295 


GGA 


GAC 


ATA 


CCA 


GGA 


GAT 


ATA 


AGA 


GAA 


GCA 


Gly Asp 


lie 


Arg Gly 


Asp 


lie 


Arg 


Gin 


Ala 




300 










305 








ATT 


AGT 


GGA 


GCA 


AAA 


TGG 


AAT 


AAC 


ACT 


TTA 


lie 


Ser 


Gly 


Ala 


Lys 


Trp 


Ash 


Asn 


Thr 


Leu 






315 








320 




GTT 


AAA 


AAA 


TTA 


AAA 


GAA 


CAA 


TTT 


CCA 


AAT 



270 : . 

ACA ATA AAA 
Thr lie Lys 
28.5 

TAT GCA ACA 

:Tyr Ata Thr 

CAT TGT. AAC 
His Cys Asn 
310 



vdi uy a uys « * — w-... • • ■ — •> 

325 330 335 

GTC TTT AAC CAT TCC TCA GGA GCG GAC CCA GAA ATT GTA 1050 
20 Val Phe Asn His, Ser Ser Gly Gly. Asp Pro Glu Tie Val 
340 . 345 350 

ATG CAC AGT TTT AAT TGT CAA GGG GAA TTT TTC TAC TGT 1089 
Met His Ser Phe Asn Cys Gin Gly Glu Phe Phe Tyr Cys 
355 • ^ 360 : 

25 AAT ACA ACA AAG CTG TTT AAT AGT ACT TGG AAT GAT ACT 1128 
Asn Thr Thr Lys Leu Phe Asn Ser. Thr Trp Ash. Asp Thr 

365 370 .: 375 

ACA GAG TCA AAT AAC AAT GAT ACT. ACT ATT ACA C.TC CCA 1167 
Thr Glu ser Asn Asn Asn Asp Ser Thr Me Thr Leu Pro 
3 0 380 ■: 385 :: : 

TGC AGA ATA AAA CAA ATT ATA AAC ATG TGG CAG GAA ATA 1206 
Cys Arq lie Lys Gin lie lie Asn Met Trp Gin Glu lie 
390 395 . . 400 . 

GGA AAA GCA ATG TAT GCC CCT CCC ACC AGA GGA GAA ATT 1245 
35 . Gly Lys Ala Met Tyr Ala Pro. Pro thr Arg Gly Glu lie 
405 410 : 415 

. AAA TGT TCA TCA AAT. ATT ACA GGA CTA CTG TTA ATA AGA 1284. 
Lys Cys Ser Ser Asn He . Thr Gly Leu Leu Leu. I ie ; Arg 
■ ' ' 420" • • : :• • 425 

40 GAT GGT GGT ATT AAC ACT AGC GAT GCC ACC GAG ACC TTC 1323 
Asp Gly Gly. He Asn Thr Ser Asp: Ala Thr Glu Thr Phe 

430 ■ .. : 435 . 440 

AGA CCG GGA GGA GGA GAT ATG AGG GAC AAT TGG AGA AGT 1362 
Arg Pro Gly Gly Gly Asp Met: Arg Asp Asn Trp Arg Ser 
' 45". • 445 "'.'450 . " '. 

GAA TTA TAT AAA TAT AAA GTA GTG AAA ATT GAG CCA TTA 1401 
Glu Leu Tyr Lys Tyr Lys Val Val : Lys ..lie Glu Pro Leu 
• 455 ' 460 .' ■ ...••".'. : \ : -" : . 465 

GGA GTA GCA CCC ACC AAG GCA AAG AGA AGA GTG GTG CAG 1440 
50 Gly Val Ala Pro Thr Lys Ala Lys Arg Arg Val Val Gin 
470 475 . ... 480 . 

AGA GAA AAA AGA GCA GTA ACA CTA; GGA GCT ATG TTC CTT 1479 
Arg Glu Lys Arg Ala Val Thr: Leu Gly Ala Met Phe Leu 
485 490 
55 GGG TTC TTA GGA GCA TAA AGC TTC 1503 
Gly Phe Leu Gly Ala Xaa Ser Phe : > 
495 500 501 

CLONE C6.5 

60 GGG GTA CCT CTA TGG AAA GAA .GCA ACC ACC ACT CTA 36 

Gly Val Pro Vai Trp Lys Glu Ala Thr Thr Thr Leu 
■ 1 5- • 10 . ' 

TTT TGT GCA TCA GAT GCT AAA GCA TAT GAC ACA GAG GTG 75 
.Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp Thr Glu Val 
65 15 20 2$ 
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CAT 


AAT GTT; TGG 


GCC 


ACA 


;CAT. 


GCT 


TGT 


GTA 


CCC 


ACA 


GAC 


114 




His 


Asn Val: Trip 


Ala 


Thr 


His 


Ala 


Cys Val 


Pro 


Thr 


Asp 














30 










35 












..' CCA 


AAC CCA; CAA 


GAA 


ATG 


GTA 


TTG 


GAA 


AAT 


GTG 


ACA 


GAA 


153 


.5 . 


Pro 


Aan Pro; Glh 


Glu 


Met 


Val 


Leu 


Glu 


Asn 


Val 


Thr 


Glu; 








40 








45 










: : 50 








GAT 


TTT 


AAC 


ATG 


TGG 


AAA 


AAT 


GAC 


ATG GTA 


GAA 


CAG 


ATG 


192 




Asp 




Ash Met 


Trp 


Lys 


Asn 


Asp 


Met 


Vai 


Glu 


Gin 


Met 










:: 55; 










.60 












10 


CAT 


GAG 


ANT; 


ATA 


ATC. 


AGT 


TTA 


TGG 


GAT 


CAA 


AGC 


CTA 


AAA 


231 




His 


Glu 


Xaa 


lie 


lie 


Ser 


Leu 


Trp 


Asp 


Gin 


Ser 


Leu 


Lys 






65 










70 






75 










CCA 


TGT GTA 


AAA 


TTA 


ACC 


CCA 


CTC 


TGT. 


ATT 


ACT 


• TTA 


AAT 


2 70 




Pro 


Cys 


val 


Lys 


Leu 


Thr 


Pro 


Leu 


Cys 


lie' 


Thr : 


Leu 


Asn 




15 






80 








85 ., 










90 






TGC 


ACC 


. AAT . 


TGG 


AAG 


GAG 


AAT 


GAT 


ACT 


AAA 


ACT 


AAT 


AGT 


309 




Cys 


Thr 


Asn 


Trp 


Lys 
95 


Glu 


Asn 


Asp 


Thr 


Lys: 
100 


Thr 


Asn 


Ser 






■ AGT 


AGT 


ACT 


ACA 


ACT 


AAT 


AAT 


AGT 


AGT 


GCT 


ACA 


GCT 


AAT 


34 8 


20 


ser 


Ser 
105 


Thr 


Thr 


Thr 


Ash 


Ash 
110 ; 


Ser 


Ser 


Ala 


Thr 


Ala 
115 


Asn 






AGT 


AGT 


AGT 


ACT 


ACA 


ACT 


AAT 


AGT 


AGT. 


TGG 


GGA 


GAG 


ATA 


387 




Ser 


Ser 


Ser 


Thr.' 
120 : 


Thr 


Thr 


Asn 


Ser 


Ser 
125 


Trp 


Gly 


Glu 


lie 




25 


AAG 


GAG 


GGA GAA 


ATA. 


AAG 


AAC 


TGC. 


TCT 


TTC 


AAT 


ATC 


ACC 


426 




Lys 


Glu 


Gly 


Glu 


He 


Lys 


Asn 


Cys 


Ser 


Phe 


Asn 


lie 


Thr 






130 








135 








140 










ACA 


GGC 


ATA 


AGA 


GAC 


AAG 


GTG 


AAG 


AAA 


GAA 


TAT 


GCA 


CTT. 


465: 




Thr 


Gly 


lie 


Arg 


Asp 


Lys 


Val 


Lys 


Lys 


Glu 


Tyr 


Ala 


Leu 




30 






145 










150 










155 






TTT 


TAT 


AGC 


CTT 


GAT 


GTA 


GTA 


CCA 


ATA 


GAA 


AAT 


GAT 


AAT 


504 




Phe 


Tyr 


Ser 


Leu 


Asp 


Val 


Val 


Pro 


lie 


Glu 


Asn 


Asp 


Asn 












160 










•165- 












ACT 


AGC 


TAT 


AGG 


TTG . 


AGA 


AGT 


TGT 


AAC 


ACC 


TCA 


GTC 


ATT 


543 


35 


Thr 


Ser Tyr Arg 


Leu 


Arq 


Ser 


Cys 


Asn Thr . 


Ser 


Val 


lie 








17.0 










175 










180 








ACA 


CAA GCC; 


TGT 


CCA 


AAG 


GTA 


ACT 


TTT GAG 


CCA 


ATT 


CCC 


582 




Thr 


Gin 


•Ala; 


Clys 


Pro 


Lys 


Val 


Thr 


Phe 


Glu 


Pro 


I le 


Pro 












185 








190 












4 0 


ATA 


CAT . 


TAT 


TGT 


ACC 


CCG 


GCT 


GGT. 


TTT 


GCG 


ATT 


CTG 


AAG 


62 1; 




lie 


His Tyr.; Cys 


Thr 


Pro 


Ala 


Gly 


Phe 


Ala 


lie 


Leu 


Lys 






195 










200 










205 










TGT 


AAA 


GAT 


AAA 


AAG 


TTC 


AAT 


GGA 


ACA 


GGA 


CCA 


TGC 


AAA 


660 




Cys 


Lys .Asp Lys 


Lys 


Phe 


Asn. 


Gly 


Thr 


G ly 


Pro 


Cys 


Lys 




45 






210 










215 










220 






AAT 


GTT 


AGC 


ACA ; 


GTA 


CAA 


TGT 


ACA 


CAT 


GGA 


ATT 


AAG 


CGA 


699 




Asn 


Val 


Ser 


Thr 


Val 


Gin 


Cys 


Thr 


His Gly 


lie 


Lys 


Pro 














22 5 










230 












GTA 


GTG 


TCA 


ACT 


CAA 


CTG 


CTG 


TTA 


AAT 


GGC 


AGC 


CTA 


GCA 


738 


50 


Val 


Val 
235 


Ser 


Thr 


Gin 


Leu 


Leu 
240 


Leu 


A9n 


Gly 


Ser 


Leu 
24 5 


Ala 






GAA 


GAA 


GAG 


GTA 


ATA 


ATT 


AGA 


TCT 


GCC 


AAT 


TTC 


TCA 


AAC 


77 7 




Glu 


Glu 


Glu 


Val 
250 


lie 


lie 


Arg 


Ser 


Ala 
255 


Asn 


Phe 


Ser 


Asn 




55 


AAT 


GCT 


AAA 


ATC 


ATA 


ATA 


GTA 


CAG 


TTG 


AAG 


GAA 


CCT 


GTA 


816 




Asn 


Ala Lys 


lie 


Tie. 


lie 


val 


Gin 


Leu 


Lys 


Glu 


Pro 


Val. 






260 










26:5 










27 










GAA 


ATT 


AAT 


TGT 


ACA 


AGA 


CCC 


AGC 


AAC 


AAT 


ACA 


ATA 


AAA 


855 




Glu 


lie Asn 


Cys 


Thr 


Arg 


Pro 


Ser 


Asn 


Asn 


Thr 


lie 


Lys 




60 






275 






280 










285 






GGT 


ATA 


CAC 


ATA 


GGA 


CCA 


GGG 


AGA 


GCA 


TTT 


TAT 


GCA 


ACA 


894 




Gly 


lie His 


lie 


Gly 


Pro 


Gly 


/Arg 


Ala 


Phe 


Tyr 


Ala 


Thr. 





' :'2 90 •' / ' ' 295 
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GGA GAC ATA CCA . GGA GAT ATA AG A CAA GCA GAT TGT AAC 933 
, Gly. Asp lie Arg Gly Asp lie Arg Gin Ala His Cys Asn ; : : : 

: :: :: ATT AGT GG^ AAG ,GTA ; 972 

5 lie Ser Gly Ala Lys Trp Asn Asn Thr Leu Lys : Lys ; Val: .. 
315 320 
GTT ATA AAA: TTA AAA GAA CAA TTT CCA AAT AAA ACA ATA 1011; ; . 
val lie Lys Leu Lys Glu Gin Phe Pro Asn Lys Thr lie 
325 330 335 ' 

10 GTC TTT AAC CAT TCC TCA GGA GGG GAC CCA GAA ATT GTA 1050 
Val Phe Asn His Ser Ser Gly Gly Asp Pro Glu lie Vai 

340 345 . 350 • 

ATG CAC AGT TTT AAT TGT CAA GGG GAA TTT TTC TAC TGT 1089 
Met His Ser Phe Asn Cys Gin Gly Glu Phe Phe Tyr . Cys .. 
re ; 355 360 

AAT ACA ACG AAG CTG TTT AAT AGT ACT TGG AAT GAT ACT 1128 
Ash Thr Thr Lys Leu Phe Asn Ser Thr Trp Asn Asp. Thr 

365 370 . 375 

ACA GAG TCA AAT AAC AAT GAT AGT ACT ATT ACA CTC CCA 1167 
20 Thr Glu Ser Asn Asn Asn Asp Ser Thr lie Thr Leu Pro 
380 385 
TGG AG A ATA AAA CAA ATT ATA AAC ATG TGG CAG GAA GTA 1206 
Cvs Arq lie Lys Gin lie lie Asn Met Trp Gin Glu Val 
: 390 395 400 7. 

25 GGA AAA GCA ATG TAT GCC CCT CCC ATC AGA GGA GAA ATT .1245 
. Gly Lys Ala Met Tyr Ala Pro Pro He Arg Gly Glu lie . : 

1 405 410 415 : 

AAA TGT TCA TCA AAT ATT ACA GGA CTA CTG TTA ACA AGA 1284 
Lys Cys Ser Ser Asn He Thr Gly Leu Leu Leu Thr Arg, ., 
20 ■ • 420 425 

: GAT CGT GCT ATT AAC ACT AGC GAT GCC ACC GAG ACC TTC : 1323 
Asp Gly Gly lie Asn Thr Ser Asp Ala Thr Glu Thr Phe 
; 430 435 440 . , 

AGA CCG GGA GGA GGA GAT ATG AGG GAC AAT TGG AGA ACT 1362 
35 Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg, Ser , 
■445 ■ ■ 450 . 

- GAA TTA TAT AAA TAT AAA GTA GTG AAA ATT GAG CCA TTA 140.1 
. Glu Leu Tyr Lys Tyr Lys Val Val Lys lie Glu -Pro Leu 
455 460 465 

GGA GTA GCA CCC ACC AAG GCA AAG AGA AGA GTG GTG CAG .1440 



40 



Gly Val Ala Pro Thr Lys Ala Lys Arg Arg Val Val Gin ; 

■" y ' 470 - ■ 475 4^0, 



AAA AGA GCA GTA ACA CTA GGA GCT ATG TTC .CTT. i4 79 
Arg Glu Lys : Arg Ala Val Thr Leu Gly Ala = Met Phe . Leu ; ; ; 
•45 : : . 485 ■ 490. 

GGG TTC TTG GGA GCA TAA AGC TTC 1503 
Gly Phe Leu Gly Ala Xaa Ser Phe 
. : : 495 • . . / . 500 501 . 

.c 0 ' CLOWE C8. 3 

G GTA CCT GTA TGG AAA GAA GCA ACC ACC ACT CTA TTT . 37 
Val Pro Val Trp Lys Glu Ala Thr Thr Thr . Leu Phe 

■ ■ -1 ; ■; . s 10 

TGT GCA TCA GAT GCT AAA GCA TAT GAT ACA GAG GTA CAT 76 

55 Cys Ala Ser Asp Ala Lys Ala Tyr Asp Thr Glu Val His 

. ' 15 20 . ■ 25 ■ 

AAT GTT TGG GCT ACA CAT GCC. TGT GTA CCC ACA GAC CCC .115 

. Asn Val: Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro 

30 35 

60 AAC CCA CAA GAA GTA GTA TTG GAA AAT GTA AGA GAA AAT 154 
Asn Pro Gin Glu Val Val Leu Glu Asn Val Thr, Glu Asn : 

. 4 q 45. . .. 50. :. • 

TTT* AAC ATG TGG AAA AAT AAC. ATG . GTA GAA CAG ATG CAT 193 : 
Phe Asn Met Trp Lys Asn Asn Met Val Glu Gin Met His :: 
65 55 6° 
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GAG 


GAT 


ATA 


ATC 


AGT 


TTA 


TGG 


GAT 


CAA 


AGT 


CTA 


AAG 


CCA 


232 ' 




Glu Asp 


lie 


lie 


Ser 


'Leu 


Trp- 


Asp 


Gin 


Ser 


Leu 


Lys 


Pro 






65 










70 




■ * '. ■. 




75 










TGT 


GTA 


AAA 


TTA 


ACC 


CCA 


CTC 


TGT 


GTT 


ACT 


TTA 


AAT 


TGC 


271 


5 


Pys 


Val 


Lys 


Leu 


Thr 


Pro 


Leu 


Cys 


Val 


Thr 


Leu 


Asn 


Cys 








80 










85 










90. 






ACT 


AAT 


TTG 


GAG 


AAT 


GCT: 


AAT 


AAT 


ACC 


GAG 


AAT 


GCT 


AAT 


310 ' 




Thr 


Asn 


Leu 


Glu 


Asn 

95 


Ala 


Ash 


Asn 


thr 


Glu 
100 


Ash 


Ala 


Asn 




10 


AAT 


ACC 


AAT 


AAT 


TAT 


ACC 


TTG : 


GGG 


ATG 


GAG 


AGA 


GGT 


GAA 


349 




Asn 


Thr 


Asn 


Asn 


Tyr 


Thr 


'Leu 


Gly 


Met 


Glu 


Arg 


Gly 


Glu 








105 








no 










lis 








ATA 


AAA 


AAC 


TGC 


TCT 


TTC 


AAT 


:ATC 


ACC 


ACA 


AGC 


TTA 


AGA 


388= 




lie 


Lvs 

Jr 


Asn 


Cys 


Ser 


Phe 


Ash 


lie 


Thr 


Thr 


Ser 


Leu 


Arg 




15 






120 










125 












GAT 


AAG 


GTG 


AAA 


AAA 


GAA 


TAT 


GCA 


TTG 


TTT 


TAT 


AAA 


CTT 


427 




Asd 


Lvs 


Val 


Lys 


Lys 


Glu 


Tyr 


Ala 


Leu 


Phe 


Tyr 


Lys 


Leu 






130 










135 










140 










GAT 


GTA 


GTA 


CAA 


ATA 


GAT 


.AAT 


AGT 


ACC 


AAC 


TAT 


AGG 


CTG 


466 


20 


Asd 


Val 


Val 


Glh 


lie 


Asp 


Asn 


Ser 


Thr 


Asn 


Tyr 


Arg 


Leu 








145 










150 










155 






ATA 


ACT 


TGT 


AAT 


ACC 


TCA 


GTC 


ATT 


:ACA. 


CAG 


GCC 


TGT 


CCA 


505. 




lie 


Ser 


Cys 


Asn 


Thr 
160 


Ser 


Val 


lie 


Thr 


Gin 
165 


Ala 


Cys 


Pro 




25 


AAG 


GTA 


TCC 


TTT 


GAG 


CTA 


ATT 


CCC 


ATA 


CAT. 


TAT 


TGT 


GCC 


544 




Lvs 


Val 


Ser 


Phe 


Glu 


Leu 


lie 


Pro 


lie 


His 


Tyr 


Cys 


Ala 






170 










175 








180 








CCG 


GCT 


GGT 


TTT 


GCG 


ATT 


CTA 


AAG 


TGT 


AAA 


GAT 


AAG 


AAG 


583 




Pro 


Ala 


Gly 


Phe 


Ala 


lie 


Leu 


Lys 


Cys Lys 


Asp 


Lys 


Lys 




3 0 








185 










190 














TTC 


AAT 


GGA 


ACA 


GGA 


CCA 


TGT 


AAA 


AAT 


GTC 


AGC 


ACA 


GTA 


622 




Phe 


Asn 


Gly 


Thr 


Gly 


Pro 


Cys 


Lys 


Asn 


Val 


Ser 


Thr 


Val 






195 








200 










205 










CAA 


TGT 


ACA 


CAT 


GGA 


ATT 


AGA 


CCA 


GTA 


GTA 


TCA 


ACT 


CAA 


661 


35 


Gin 


Cvs 


Thr 


His 


Gly 


lie 


Arg 


Pro 


val 


Val 


Ser 


Thr 


Gin 








210 








215 










220 






CTA 


CTG 


TTA 


AAT 


GGC 


AGT 


CTA 


GCA 


GAA 


GAA 


GAG 


ATA 


GTA 


700 




Leu 


Leu 


Leu 


Asn 


Gly 
225 


Ser 


Leu 


Ala 


Glu 


Glu 

230. 


Glu 


lie 


val 




40 


. ATT 


AGA 


TCT 


GAA 


AAT 


ATC 


ACA 


GAC 


AAT 


GCT 


AAA 


ACC 


ATA 


739 




. lie 


Arcr 
2 35 


Ser. 


Glu 


Asn 


He 


Thr 

240; 


ASp 


Asn 


Ala 


Lys 


Thr 
24 5 


lie 






ATA 


GTG 
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ACC GGG AAT CGT ACT GAA GAG TTA AAT ATT ACA CTC CCA 1129 
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10 TGC AG A ATA AAA CAA ATC ATA AAC TTG TGG CAG GAA GTA 1168, 
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GGC AAA GCA ATG TAT. GCC CCT CCC ATC AG A GGA CAA ATT 1207 
Gly . Lys Ala Met Tyr Ala Pro : Pro lie Arg Gly Gin He 
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AGA TGT TCA TCA AAT ATT ACA GGG CTA CTA TTA ACA AG A 1 246 
Arg Cys Ser Ser Asn lie Thr Gly Leu Leu Leu Thr Arg 

405 • 410 415 , 

GAT GGT GGA AGT AAC ACC GGT GAC AAC AGG ACT GAG ACC 1285; 
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Cys Val Lys Leu Thr Pro Leu Cys. Val Thr Leu Asn Cys 

•V.-'. ■ 80 85 • , 9.0. ' 

60 ACT AAT TTG GAG AAT GCT AAT. AAT ACC GAG AAT GCT AAT 310 . 

Thr Asn Leu Glu Asn Ala Ash Asn Thr Glu Asn Ala Asn 

95 100 
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Gin 


Arg 


Glu Lys 








480 




ATG 


TTC 


:CTT 


GGG TTC 


1479 


Met. 


Phe 


Leu 


Gly Phe 






490 






CGA 


CCT 


GCA 1512 




Arg 


Pro 


Ala 
504 







CLONE Ci5.3 





CTC 


GAG 


GTA 


CCT GTG 


TGG 


AAA 


GAA 




Leu 


Glu 


Val 


Pro 


Val 
5 


trp 


Lys 


Glu 


1 

CTA TTT 


TGT 


GCA 


TCA 


GAT 


GCT 


AAA 


GCA 


Leu 


Phe 


Cys 


Ala 


Ser 


Asp 


Ala 


Lys 


Ala 






15 










20 




AAA 


CAT 


AAT 


GT.T 


TGG 


GGC 


ACA 


CAC 


GCC 


Lys 


His 


Ash 


Val. 


trp 


Ala 


Thr His 


Ala 








30 










GAT 


CCC 


AAC 


CCA 


CAA 


GAA 


GTA 


GTA 


TTG 


Asp 


Pro 


Asn 


Pro Gin 


Glu 


Val 


Val 


Leu 


40 










45 






GAA 


AAT 


TTT 


AAC 


ATG 


TGG 


AAA 


AAT 


AAC 


Glu 


Asn 


Phe 


Asn 


Met 


Trp 


Lys 


Ash 


Asn 






55 








60 


ATG 


CAT 


GAA 


GAT 


ATA 


ATC 


AGT 


TTA 


TGG 



36 



Ala: Thr Thr Tnr 
10 



75 



25: 



Vai Pro Thr . 



■1-1:4- 



35 



50 



65 








: 70 






TGT- 


AAG 


CCA 


TGT 


GTA 


AAA 


TTA. 


ACC 


CCA 


CTC 


Lys 


Pro 


Cys 


Val 


Lys 


Leu 


Thr 


Pro 


Leu 


Cys ;: 




80- 










85. 


GCT 


ACT 


AAT 


TGC 


ACT 


GAT 


GAT. TTA 


GGG 


AAT 


Asn Cys Thr 


Asp 


Asp Leu : Gly 
95 


Asn Ala 


Thr 
100 


AGC 


AGT 


GCC 


ACT 


ACC 


AAT 


AGT 


AGT 


AGT 


TGG 


Ser 


Ser 


Ala 


Thr. 


Thr Ash 


Ser 


Ser 


ser 


trp 




105 










lib 






AAT 


AAG 


GGG 


GAA 


ATG 


AAA 


AGG 


TGC 


TCT 


TTC: 


Lys 


Gly 


Glu 


Met 


Lys 


Arg 


Cys 


Ser 


Phe 


Asn 




120 










125 


CAT: 


AGC 


ATA 


AGA 


GAT 


AAG 


ATT 


AAG 


AAA 


GAA 


Ser 


lie 


Arg Asp 


Lys 


lie 


Lys 


Lys 


Glu 


His 


130 










,135 








AAT 


TAT 


AGA 


CTT 


GAT 


GTA 


GTA 


CCA 


ATA 


GAT 


Tyr 


Arg 


Leu 


Asp : 


Val 


Val 


Pro 


lie 


Asp 


Asn 


145 










150 






ACA 


TAT 


AGG 


TTG 


ATA 


AAT 


TGT 


AAT 


ACC 


TCA 


Thr 


Tyr 


Arg 


Leu 


lie 


Ash 


Cys 


Ash 


Thr- 


Ser 






160 










165 : 


CAG 


GCC 


TGT 


CCA 


AAG 


GTA 


TCA 


TTT 


GAG 


CCA 


Gin 


Ala 


Cys 


Pro 


Lys 


Val 


Ser 


Phe 


Glu 


Pro 




170 








175 






ATT 


CAT 


TTT 


TGT 


GCC 


CCG 


GCT 


GGT 


TTT 


GCG 


His 


Phe 


Cys 


Ala 


Pro Ala Gly 


Phe 


Ala 


lie 






185 










190 





75 



90 



115; 



140 



155 : 



.180 
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AAT ;AAT AAG ACG TTG GAG GGA AAA GGft CCA TGT AAA A AT 621 

Asn Asn Lys Thr Phe Glu Gly Lys Gly "Pro Cys Lys Ash '• 

v:..:--\l95. v ; .-v: : ; : : ; :v,- 200 . ■ -V:; ; :; ; v 205 

GTC ACT ACA ■ GTA CAA TGC ACA CAT GGA ATT AGG CCA GTA 660. 
5 . Val Ser. Thr Val Glh Cys Thr His Gly il e A r <g Pro V a 1 

:/ : :::-.:::2:iq';-'V;;' :V- ' 215 . , .". 220 

GTG TCA ACT CAA CTG CTG TTA AAT. GGC AGT CTA GCA GAA 699 . 

. Val Ser thr Glh Leu Leu Leu Asn Gly Ser. Leu Ala Glu 

- : 22 5 ' • 230 . . 

10 GAA GAG GTA ATA ATT AGA TCT GGC AAT ATC ACA GAC AAT 738 
Glu Glu Val lie lie Arg Ser Gly Asn lie Thr Asp Asn 

235; 2 40 24 5 

ACT AAA ACC ATT ATA GTA GAG CTA AAC GAA TCT GTA GTA 777 
Thr Lys Thr lie lie Val Gin Leu Asn Glu Ser Val Val 
15 250: • 255 '." 

ATT AAT TGT ACA AGA TCC. AAC AAC AAT ACA AGA AAA AGT 816 
lie Asn Cys Thr: Arg Ser Asn Asn Asn Thr Arg Lys Ser 
260 265 270 

ATA CAT ATA GGA CCA GGG AGT GCA TTT. TTT GCA ACA GGA 855 
20 lie His lie Gly Pro Gly Ser Ala Phe Phe Ala Thr Gly 
275 , 280 . 285 

GAA ATA ATA GGA- GAT ATA AGA CAA GCA CAC TGT AAC CTT 894 
Glu lie tie Gly Asp lie Arg: Gin Ala His Cys Asn Leu 
V 290 295 
25 AGT AGA ACA CAA TGG AAT AAC ACT TTA GGA AAG ATA GTC 933 
Ser Arg Thr Gin Trp Asn Asn Thr Leu Gly Lys lie Val 

300 305 .' 310 

ATA AAA TTA AGA GAA CAA TTT AGA AAA CAA TTT GGA GAA 972 
lie Lys Leu Arg Glu Gin Phe Arg Lys Gin Phe Gly Glu 
3 0 ' 315.Y.'; ; ';.\. . 320 . ' 

AAA ACA : ATA GTC TTT AAT CGA TCC TCA GGA GGG GAC CCG 1011 
Lys Thr lie Val Phe Asn Arg Ser Ser Gly Gly Asp Pro 
325 . 330 335 

GAA ATT GCA ATG CAC AGT TTT AAT TGT GGA GGG GAA TTT 1050 . 
35 Glu lie Aia Met His Ser Phe Asn Cys Gly Gly Glu Phe 
340 345 350 

TTC TAC TGT. AAC ACA ACA GCA CTG TTT AAT AGT ACC TGG 1089 
Phe Tyr Cys Asn Thr Thr Ala Leu Phe Asn Ser thr trp 
. 3 55 • 3-60. . 

40 AAT GTT ACT AAA GGG. TTG AAT AAC ACT GAA GGA AAT AGC 1128 
Asn Val: The Lys Gly! Leu Asn Ash Thr Glu Gly Asn Ser 
. , 365,v:-v- ; : :— . 3170." , 375 '. ' 

ACA GGG GAT GAA AAT ATC ATA CTC CCA TGT AGA ATA AAA. 1 167 
Thr Gly Asp Glu Asn lie lie Leu Pro Cys Arg He Lys 

45 •: ' "380 • ' : 385 

CAA ATT ATA AAC ATG TGG CAG GAA GTA GGA AAA GCA ATG 1206 . 
Gin lie lie Asn Met Trp Gin Glu Val Gly Lys Ala Met 

390 .■' ' % .vv : ;;. : '.; ; 395 • . 400 

TAT GCC CCT CCC ATC AGT GGA CAA ATT AGA TGT TCA TCA .1245 

50 Tyr Ala Pro Pro lie Ser Gly Gin lie Arg Cys Ser Ser 
405 410 415 

AAT ATT ACA GGG CTG CTA CTA ACA AGA CAT GGT GGT AGT 1284 

Asn lie thr Gly Leu Leu Leu Thr Arg Asp Gly Gly :Ser 

420'..;:;:: :: ',.; 425' • 

55 AAG AAC GAG AGC ATC ACC ACC GAG GTC TTC AGA CCT GGA 1323 

Lys Asn GlU Ser lie Thr Thr Glu Val Phe Arg Pro Gly 
. 430V. 435 - 440 - ; ' ' 

GGA GGA GAT ATG AGG GAC AAT TGG AGA AGT GAA TTA TAT 1362 

Gly Gly Asd Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr 
60 . . . 445 450 

AAA TAT AAA GTA GTA AAA ATT GAA CCA TTA GGA GTA CCG 1401 

Lys tyr Lys Val Val Lys lie Glu Pro Leu Gly Val Ala 

455 ■ . : 4.60 •;• ■ ■ ■ 465- : 
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CCC ACC AAG GCA AAG AGA AG A GTG GTG C AG AG A GAA AAA 1440 
Pro Thr Lys • Ala: L y s Arg Ar ig Va 1 Va 1 G 1 n Arg Glu Lys 

: - 475 • • 480 _ 



470 



AGA GCA GT^G GGA ACA A^A GGA GCT ATG TTC CTT GGG TTC 1479 
Arg : Ala Val Gly- Thr lie Gly Ala Met Phe Leu Gly Phe : 

:■: \.^=v/;:*SSA=-vV;:;v-.' v-". := . ; : ; '490 

TTA GGA GCA TAX AGC TTC TAG A 1501 
Leu Gly Ala Xaa Ser Phe Xaa 

495 500 . 



CLONE C7.2 . ■ - . 

GG GAA TTC GGA TCC GGG GTA CCT GTG TGG AAG GAA GCA 38 
Glu Phe Gly Ser Gly Val Pro Val Trp Lys : Glu Ala 
1 .5 : 1P = 

15 ACC ACC ACT CTA TTC TGT GCA TCA GAT GCT AGA GCA TAT 77 
Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Arg Ala Tyr 

15 ' 20 25 

GAC AC A GAG GTA CAT AAT GTT TGG GCC AC A CAT GCC TGT 116 
Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys 
20 30 • 35 

GTA CCC ACA GAC CCT AGT CCA CAA GAA GTA GTT TTG GAA 155 
Val Pro Thr Asp Pro Ser Pro Gin Glu Val Val Leu Glu 

40 ■ 4 5 : 50 

AAT GTG ACA GAA AAT TTT, A AC ATG TGG AAA AAT. A AC ATG 194 
25 Asn Val Thr Glu Asn Phe Asn Met Trp Lys Asn Asn Met 
55 . 60 

GTA GAA CAA ATG CAT . GAG GAT ATA ATT AGT TTA TGG GAT 2 33 
Val Glu Gin Met His. . Glu Asp Iie He Ser Leu Trp Asp 
65 '. 70 75 

30 CAA AGC TTA AAG CCA TGT. GTA AAA TTA ACC CCA CTC TGT 272 
Gin Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys 
80 85 90 

GTT ACT TTA AAT TGC -AGT GAT TAT AGG AAT GCT ACT GAT 311 
Val Thr Leu Ash Cys Ser Asp Tyr Arg Asn Ala Thr Asp 
35 95 " : 100 

TAT AAG AAT GCT ACT GAT ACC ACT AGT AGT AAC GAG GGA 350 
Tyr Lys Asn Ala Thr Asp Thr Thr Ser Ser Asn Glu Gly 

105 . . .110 ; : US . 

AAG ATG GAG AGA GGA GAA ATA AAA AAC TGC TCT TTC AAT 389 
40 Lys Met Glu Arg Gly Glu lie Lys Asn Cys Ser Phe Asn 
• 120 125 
ATT ACC ACA AGC ATA AAA AAT AAG ATG CAG AAA. GAA TAT 428 
lie Thr Thr ^ Ser :: Ile .Lys L Y S Glu T y r 

120 ■ •'■;>■'':■■•:;•■ '135 140 • 

45 • GCA CTT TTC TAT AAA CTT GAT ATA GTA CCA ATA GAT AAT 467 
Ala Leu Phe Tyr Lys Leu Asp lie Val Pro lie. Asp Asn 
145 150 155 

ACA AGC TAT ACA TTG -ATA AGT TGT AAC ACC TCA GTC ATT 506 
Thr Ser Tyr Thr Leu, lie Ser Cys Asn Thr Ser Val lie 
50 160 165 : 

ACA CAG GCC TGT CCA AAG GTA TCC TTT GAA CCA ACT CCC 545 
Thr Gin Ala Cys Pro Lys, Val Ser Phe Glu Pro Thr Pro 

170 : - 175 , 180 

ATA CAT TAT TGT <3CT CCG GCT GGT TTT GCG ATT CTA AAG 584 
55 He His Tyr Cys Ala Pro Ala Gly Phe Ala lie Leu Lys 
• • ■ 185 ' 190 • • 

TGT AAT GAT AAG AAG TTC AGT GGA AAA GGA GAA TGT AAA 623 
Cys Asn Asp - Lys Lys . Phe Ser Gly Lys Gly Glu. Cys Lys 
195 : . . 200 v 205- . 

60 AAT GTC AGC ACA GTA CAA TGT ACA CAT GGA ATT AGG CCA 662 
Asn Val Ser Thr Val Gin Cys Thr His Gly lie Arg Pro 
2io 215 220 

GTA GTA TCA ACT CAA CTC CTG TTA AAT. GGC AGT CTA GCA 701 
Val Val Ser Thr Gin Leu Leu Leu Asn Gly Ser: Leu Ala 
65 -225 230 
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GAA GAA GAG GTG GTA ATT AGA TCT GAC AAT TTC ATA GAC 740 
Glu Glu Glu Val Val I le .Ar£ Ser Asp Asn Phe lie Asp S 

. 235 240 ■ 245 ' 

AAT ACT AAA ACC. ATA ATA GTA CAG CTG AAA GAA TCT GTA 779 

5 Asn Thr Lys Thr I lie lie. Va l Gln : l^\x; ty 3 Glu Se^t Vai: 
. -'250. •yZ-r^VT-'':: 2'5'S:--r- 

GAA ATT AAT TGT ATA AGA CCC AAC AAT AAT AC A AG A AAA 818 
Glu He Asn Cys lie Arg Pro Asn Ash Asn Thr Arg Lys 
260 265. . 270 

10 GGT ATA CAT ATA GGA CCA: GGG AGA GCA TGG TAT GCA ACA 857 
Gly He His lie Gly Pro Gly Arg Ala frp Tyr Ala Thr 
275 . ... 280 285 

GGA GAA ATA GTA GGA GAT ATA AGA AAG GCA TAT TGT AAC 896 
Gly Glu lie Val Gly Asp lie Arg Lys Ala Tyr Cys Asn 
15 290 : 295 

ATT AGT AGA ACA AAA TGG AAT AAC ACT TTA ATA CAG ATA 935 
He Ser Arg Thr Lys Trp Asn Asn Thr Leu lie Gin lie 

300 305 310 

GCT AAC AAA TTA AAA GAA AAA TAT. AAT ACA ACA ATA ACC 974 
20 Ala Asn Lys Leu Lys Glu Lys Tyr . Asn Thr Thr He Ser 
315 320 :;: 

TTT AAT CGA TCC TCA GGA GGG GAC CCA GAA ATT GTA ACC 1013 
Phe Asn Arg .ser Ser Gly Gly Asp Pro Glu lie Val Thr ■ 
325 330 : 335 

25 CAT AGT TTT AAT TGT GGA GGG; GAG TTT TTC TAG TGT GAT 1052 
His Ser Phe Asn Cys Gly Gly Glu: Phe Phe Tyr Cys Asp 
340 345 350 

TCA ACA CAA CTG TTT . AAT. . AGT .' ACT TGG AAT TTA AAT GGT 1091 
Ser Thr Gin Leu Phe Asn Ser Thr Trp Asn Leu Asn Gly 
30 355 360 ." 

ACT TGG AAT TTT ACT GCA GGG: TCA AAT GAA ACT GAA GGC 1130 
Thr. Trp Asn Phe Thr Ala Gly. Ser Asn Glu Thr Glu Gly 
365 370 ; : 37 5 

ATC ACA CTC CCA TGC AGA .ATA AAA CAA ATT ATA AAC 1169 

3 5 Asn lie Thr Leu Pro Cys .Arg lie: Lys Gin Tie lie Asn 

380 : : . 385 ; 

AGG TGG CAG GAA GTA GGG AAA GCA ATG TAT GCC CCT CCC 1208 
Arg Trp Gin Glu Val Gly Lys Ala : Met Tyr Ala Pro Pro 
390 395 400 

4 0 ATC AGT GGA; CAA ATA AAA TGC TCA TCA AAC ATT ACA GGG 1247 

lie Ser Gly. Gin lie Lys Cys Ser Ser Asn lie Thr Gly 

405 : 4 io .' • 4 1 5 • 

ATG ATA TTA ACA AGG GAT GGT GGT AAC: GAG AAC AAT AAT 1286 
Met lie Leu thr Arg Asp Gly Gly Asn Glu Ash Asn Asn 
45 .. 420- ■' •v.'.-. .--. -'-'" 42V .' 

GAG AGC AGT ACT ACT GAG ACC TTC ; AGA CCG GGA GGA GGA 1325 
Glu Ser Ser Thr Thr Glu thr Phe Arg Pro Gly Gly Gly 
430 ' . :435V : : 440 

n GAT ATG AGG AAC AAT TGG AGA AGT GAA TTA TAT AAA TAT 1364 
50 Asp Met Arg Asn Asn Trp Arg Ser Glu Leu Tyr Lys Tyr 
445 450 . 

AAA GTA GTA AAA ATT GAA CCA TTA GGA GTA GCA CCC ACC 1403 
Lys Val Val Lys He Glu Pro Leu Gly Val Ala Pro Thr 
455 460 465 : 

55 AAG GCA AAG AGA AGA GTG GTG CAG AGA GAA AAA AGA CCA 1442 
Lys Ala Lys. Arg Arg Val Val Gin Arg Glu Lys . Arg Ala 
470 47 5 480 

GTG GGA GCG CTA GGA GCT ATG TTC CTT GGG TTC TTA GGA 1481 
Val Gly Ala Leu Gly Ala Met Phe Leu .Gly Phe Leu Gly 

485 : 490 ■ 

GCA TAA AGC TTC TAG ACC GAG TCT AGA GGA TCC 1514 
Ala Xaa Ser Phe Xaa Thr Asp Ser Arg Gly Ser 

495 500 -.504. . 
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CLONE C7.10 














GTA : 


CCT 


GTG 


TGG 


AAG 


GAA! GCA 


ACC ACC 


ACT 


CTA 


•TTC. 


,37 . 






val Pro 


Val 


Trp 


Lys ; 


:: Gl ; u''; : Ala ; - 


Thr. Thr Thr 


Leu 


Phe 








.'. 1 






'.,-'5- 






10 










TGT 


GCA 


TCA 


GAT 


GCT AGA 


GCA TAT 


: GAC ACA: 


GAG 


GTA 


CAT 


76 




Cys 


Ala 


Ser 


Asp Ala Arg 


Ala Tyr ; 


.Asp thr 


Glu 


Val 


His 








15 








20 








25 : 






AAT 


GTT TGG 


GCC 


ACA 


CAT 


GCC TGT 


GTA CCC 


ACA 


GAC 


CCT 


115 




Ash 


Val 


Trp Ala 


Thr 


His 


Ala Cys 


Val Pro 


Thr 


Asp 


Pro 




10 










30 






:: 35 












AGT 


GCA 


CAA 


GAA 


GTA 


TTT 


TTG GGA 


AAT GTG 


ACA 


GAA 


AAT 


154 




Ser 


Pro 
40 


Gin 


Glu 


Val 


Phe 


Leu Gly 

■ 45. . •. 


Asn Val 


Thr 


Glu 
50 


Asn 






TTT 


AAT ATG 


TGG 


AAA AAT 


AAC ATG 


GTA GAA 


CAA 


ATG 


TAT 


193 


15 


Phe 


Asn 


Met 


55 


Lys 


Ash 


Asn Met 


Val Glu 
60 


Gin 


Met 


Tyr 






GAG 


GAT 


ATA 


ATT 


AGT 


TTA 


TGG GAT 


CAA; AGC 


TTA 


AAG 


CCA 


232 




Glu 


Asp 


lie 


li.tr 


Ser 


Leu 


Trp Asp 


Gin Ser 


Leu 


Lys 


Pro 






65 








70 






75 








2 0 


TGT 


GTA 


AAA 


I in 


ACC 


CCA 


CTC TGT 


GTT ACT 


TTA 


AAT 


TGC 


271 




Cys 


val Lys 


Leu 


Thr 


Pro 


Leu Cys: 


Val Thr 


Leu 


Asn 


Cys 








80 








85 








90 






AGT 


GAT 


TAT 


AGG 


AAT 


GCT 


ACT GAT 


TAT AAG 


AAT 


GCT 


ACT 


310 




Ser 


Asp 


Tyr 


Arg 


Asn 


Ala 


Thr Asp 


Tyr Lys 


Asn 


Ala 


Thr 




2 5 






95 






ioo 












GAT 


ACC 


ACT 


AGT 


AGT 


AAC 


GAG GGA 


AAG. ATG 


GAG 


AGA 


GGA 


349 




Asp 


Thr 


Thr 


Ser 


Ser 


Asn 


Glu Gly 


Lys Met 


Glu 


Arg 


Gly 






105 










110 • .. 




115 








GAA 


ATA 


AAA 


AAC 


TGC 


TCT 


TTC AAT 


ATC ACC 


ACA 


AGC 


ATA. 


.388 


30 


Glu 


lie 


Lys 


Asn 


Cys 


JCl ■ 


Phe Asn 


lie Thr 


Thr 


Ser 


lie 










120 






125 ; 












AAA 


AAT 


AAG 


ATG 


CAG 


AAA 


GAA TAT 


GCA CTT 


TTC 


TAT 


AAA 


427. 




Lys 


Asn 


Lys 


Met 


Gin 


Lys 


Glu Tyr 


Ala Leu 


Phe 


Tyr 


L V s . 






130 








135; 






140 






4 66 


3 5 


Cf T 


AAT 


ATA 


GTA 


CCA 


ATA 


GAT AAT 


ACA AGC 


TAT 


ACA 


TTG 




Leu 


Asn 


lie 


Val 


Pro 


lie 


Asp Asn 


Thr Ser 


Tyr 


Thr 


Leu 










145 








. .. 150. 






155 






ATA 


AGT 


TGT 


AAC 


ACC 


TCA 


GTC ATT: 


ACA GAG 


GCC 


TGT 


CCA 


505 




He 


Ser 


Cys 


Ash 


Thr 


Ser 


Val l ie: 


Thr Gin 


Ala 


Cys 


Pro 




4 0 








160 






•. 165 












AAG 


GTA- 


TCC-. TTT 


GAA 


CCA 


ATT CCC 


ATA CAT 


TAT 


TGT 


GCT 


544 




Lvs 


Val 


Ser 


Phe 


Glu 


Pro 


lie Pro 


lie His 


Tyr 


Cys 


Ala 






170 










17 5 




GAT 


180 








CCG 


GCT 


GGT 


TTT 


GCG 


ATT 


Cf A AAG 


TGT AAT 


AAG 


AAG 


583 


4 5. 


Pro 


Ala Gly Phe Ala 


lie 


Leu Lys 


Cys Asn 


Asp 


Lys 


Lys 












185 








190 












TTC 


AGT 


GGA 


AAA 


GGA 


GAA 


TGT AAA 


AAT GTC 


AGC 


ACA 


GTA 


622. 




Phe 


Ser Gly 


Lys 


Gly 


Glu 


Cys Lys 


Asn Val 


Ser 


Thr 


Val 






195. 








200 






205 






661 


50 


GAA 


TGT 


ACA 


CAT 


GGA 


ATT 


AGG CCA 


GTA GTA 


TCA 


ACT 


CAA 




Gin 


Cys 


Thr 


His 


Gly 


He 


Arg Pro 


Val Val 


Ser 


Thr 


Gin 








210 






215: 








220 






CTG 


CTG 


TTA 


AAT 


GGC 


AGT 


CTA GCA 


GAA GAA 


GAG 


GTG GTA 


700 




Leu 


Leu 


Leu 


Asn 


Gly 


Ser 


Leu Ala 


Glu Glu 


Glu 


Val 


Val 




55 : 










225 






. 230 












ATT 


AGA 


TCT 


GAC 


AAT 


TTC 


ACA GAC 


AAT ACT 


AAA 


ACC 


ATA 


7 39 




■ lie. 


Arg 


Ser 


Asp 


Asn 


Phe 


Thr Asp 


Ash; Thr 


Lys 


Thr 


lie 








.235 








240: 






245 








ATA 


GTA 


GAG 


CTG 


AAA GAA 


TCT GTA 


GAA ATT 


AAT 


TGT 


ATA 


778 


60 


lie 


Veil 


Gin 


Leu 


Lys 


Glu 


Ser Val 


Glu lie 


Asn 


Cys 


He 












250 






2 55 '•: 












AGA 


CCC 


AAC 


AAT 


AAT 


ACA 


AGA AAA 


GGT ATA 


CAT 


ATA 


GGA 


817 




Arg 


Pro 


Asn 


Asn 


Asn 


Thr 


Arg Lys 


Gly lie 


His 


He 


Gly 






260 










265 




270 
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CCA GGG AGA GCA TGG TAT GCA AC A GGA GAA ATA : GTA GGA 856 
Pro Gly Arg Ala Trp Tyr Ala Thr Gly iGlu II Val Gly 

;. 275 ■ 280 ':.r : :' : r : :r^ .285 

GAT ATA AGA CAG GCA TAT TGT AAC ATT AG T. AGA ACA AAA 895 
5 Asp lie Arg Gin Ala Tyr Cys Asn lie Ser Arg Thr Lys 

290 . ; ; 295 

; TGG AAT AAC ACT. TTA .ATA CAG ATA GCT AAC AAA T?TA AAA 9 34 ■ 
Trp Asn Asn Thr Leu lie Gin lie Ala Ash Lys Leu Lys 
300 30.5 . 310 . 

10 GAA AAA TAT AAT ACA ACA ATA AGC TTT AAT CGA TCC TCA 97 3 
. . : Glu Lys Tyr Asn T.hr Thr He Ser Phe Asn Arg Ser Ser 
315 320 . 

GGA GGG GAC CCA GAA ATT GTA ACC CAT AGT TTT AAT TGT 1012 
Gly Gly Asp Pro Glu lie Val Thr His Ser Phe Asn Cys 
15 325 .330 33 5 

GGA GGG GAA TTT TTC TAC TGT AAT TCA ACA CAA CTG TTT 1051 
Gly Gly Glu Phe Phe Tyr Cys Asn Ser Thr Gin Leu Phe 
340 345 . 350 

AAT AGT ACT TGG AAT TTA AAT GGT ACT TGG AAT TTT ACT 1090 
Asn Ser Thr Trp Asn Leu Asn Gly Thr Trp Ash Phe Thr 

' 355- • 360 : 

GCA GGG TCA AAT GAA ACT GAA GGC AAT ATC ACA CTC CCA IT 29 
Ala Gly Ser Asn Glu Thr Glu Gly Asn He Thr Leu Pro 
nc . 365 370 . 375 •• 

TGC AGA ATA AAA CAA ATT ATA AAC AGG TGG CAG GAA GTA 1158 
Cys Arg lie Lys Gin. He He Asn Arg Trp Gin Glu Val 

380 385 . 

GGA AAA GCA ATG TAT GCC CCT CCC ATC AGT GGA CAA ATA 1207 
Gly Lys Ala Met Tyr Ala Pro Pro lie Ser Gly Gin- lie 
30 390 395 400 

AGA TGC TCA TCA AAC ATT ACA GGG ATG ATA TTA . ACA AGG 1246 
Arg Cys Ser Ser Asn lie Thr Gly Met Tie Leu Thr Arg 
405 410 415 

n GAT GGT GGT AAC GAG AAC AAT AAT GAG AGC AGT ACT ACT 1285 

35 Asp Gly Gly Asn Glu Asn Asn Asn Glu Ser ..Ser- Thr. Thr 

420 425 
GAG ACC TTC AGA CCC GGA GGA GGA GAT ATG AGG AAC AAT 132 4 
Glu Thr Phe Arg Pro Gly Gly Gly Asp Met Arg .Asn Asn 
. ' 430 : 435 440 

4 0 TGG AGA AGT GAA TTA TAT AAA TAT AAA GTA GTA AAA ATT 136 3 
Trp Arg Ser. Glu Leu Tyr Lys Tyr Lys Val Val Lys lie 

445 . 4 50;' 

GAG CCA TTA GGA GTA GCA CCC ACC GAC TCT AGA GGA T^C 1402 
A C Glu Pro Leu. Gly Val Ala Pro Thr Asp Ser Arg Gly Ser 
45 455 460 : 465. . 

TCT AGA 1408 

. . Ser .Arg ■■ - - ■ 

". 469 

50 . CLONE C11.5 

GAG GTA CCT GTG TGG AAA GAA GCA ACC. ACT ACT CTA 36 
Glu Val. Pro Val Trp Lys Glu Ala Thr Thr thr Leu 
■1 5 . • 10 ■ 

TTT TGT GCA TCA GAT GCT AAA GCA TAT GAC ACA GGG GTG .75 
55. Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp Thr Gly Vai 

... 15 , 20 25 ' 

CAT AAT GTT TGG GCC ACA CAT GCC TGT GTA CCC ACA GAC 114 
His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp 

30 35 
CCC AAC CCA CAA GAA ATA GAA TTG GTA AAT GTG ACA GAA 15 3 
Pro Asn Pro Gin Glu lie Glu Leu Val Asn Val :Thr Glu 
. 40 - 45 • - : - 50 • • 

GAT TTT AAC ATG TGG AAA AAT AAA ATG GTA GAC CAG ATG 192 
_ Asp Phe Asn Met Trp Lys Asn Lys Met Vai Asp Gin Met 
65 ■■■ ' 55- : : ■ 60 •:•/• 
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CAT GAG GAT ATA ATC AGT . TTA TGG : GAT GAA. AGC CTA AAG 231 • .. 
His Giu Asp lie . lie Ser Leu Trp Asp Glu Ser Leu Lys • 

" '■' 65-' - ; :' : : :: -'. ■ : -'.- v :-. 70 v ;r:: : i";:.::;:; :: :-:::i: :■.:;=:. :::: : 7^: : - • • " 

CCA TGT GTA. AAG TTA ACC CCA CTT TGT ;: GTT ^CT CTA AAC 270 
5 Pro Cys Val Lys Leu Thr Pro Leu Cys Val thr Leu Asn ... 

. ■ : 80 :-85 v .':-:': 90 ■ - 

TGC AGT GAT GTG ; AAC AAT TCC AC A AAT 7 CCT AAT GAT ACT 309 
Cys ser Asp Val Asn Asn Ser Thr Asn Pro Asn Asp Thr 
95 • . •:r::,,"l:00\ 

10 AAT ACT AAT TCC ACT AAT ACT ACT TCC TCT ACT CCT ACG 348 
Asn Thr Asn Ser Thr Asn Thr Thr Ser Ser Thr Pro Thr 

105 110 115 

GCC ACT ACT AGT AGC GAG GAA AAG ATG GAG AAG GGA GAA 387 
Ala Thr Thr Ser Ser Glu Glu Lys Met Glu Lys Gly Glu 
15 120 125 ,, 

ATA AAA AAC TGC TCT TTC AAT . ATC. ACC ACA CAC ATG AAA 426 
He Lys Asn Cys Ser Phe Asn He Thr Thr His: Met Lys 
.130 " 135 • ' . 140 

GAT AAG GCA CAG AAA GAA TAT GCA CTT; TTT TAT AAA CTT 465 
20 Asp Lys Ala Gin Lys Glu Tyr Ala Leu Phe Tyr Lys Leu 
145 150 155 

GAT ATA GTA CCA ATA GAT GAT AAT AAT GCC -AGC TAT AGG 504, 
Asp lie Val Pro lie Asp Asp Asn Ash Ala: ser Tyr Arg 
. ." 160 165 .. . 

2 5 TTG ATA AGT TGT AAT ACC TCA GAC. ATT ACA CAG GCC TGT 543 
Leu lie Ser Cys Asn Thr Ser Asp I le Thr Gin Ala Cys 

170 175 180 

GCA AAG GTG ACC TTT GAG CCA ATT CCC ATA CAT TAT TGT 582 
Pro Lys Val Thr Phe Glu Pro lie Pro lie His Tyr Cys 
30 . 185 190 . 

GCC CCG GCT GGT TTT GCG ATT CTA: AAG TGT AAA; GAT AAG. 621 
■ Ala Pro Ala Gly Phe Ala lie Leu Lys Cys Lys Asp Lys 
195 200 . 205. 

AAG TTC AAT GGA ACA GGA CCA TGT: TCA AAG GTC AGC ACA 660 
35 Lys Phe Asn Gly Thr Gly Pro Cys ser Lys Val Ser Thr 
210 215 .., 220 

GTA CAA TGT ACA CAT GGA ATT AGG: CCA GTA GTA TCA ACT 699 
Val Gin Cys Thr His Gly lie Arg Pro Val Val Ser Thr 

.:' 225 : 230 V: ; 

40 CAA CTG TTG TTA AAT GGC AGT CTT GCA GAA GAA GAA GTA 738 
Gin Leu Leu; Leu Ash Gly Ser Leu Ala Glu Glu Glu Val 
■ 235 : 240 • . 245 . 

GTA ATT AGA .TCT GTC AAT- TTC ACA; .GAC AAT GCT. AAA ATC 777 
Val lie Arg Ser Val Asn Phe Thr Asp Ash Ala Lys lie : 

4 5 ■ • . . : 250 ' : ■ 255 .:. 

ATA ATA GTA CAG CTG AAA . GAA CCT GTA GCA ATT AAT TGT 816 
He He Val Gin Leu Lys Glu Pro Val Ala lie Asn Cys 
260 - ' - 265 - 270 

ACA AGA CCC AAC AAC AAT ACA AGA AAA GGT ATA CAT CTA 855 

50 Thr Arg Pro Ash Ash Asn Thr Arg Lys Gly lie ;His . Leu . 

275 280 285 

GGA CCA GGG AGC ACA TTT TAT ACA ACA GGA GAA. ATA ATA 894 
Gly Pro Gly Ser Thr Phe Tyr Thr : . Thr Gly . Glu lie lie 
• 290 ■ 295: 

5 5 GGA GAC ATA AGA AAA GCA TAT TGC. AAG.. ATT AGT AAA GAA 933 
Gly Asp Tie Arg Lys Ala Tyr Cys Lys lie Ser Lys Glu 

300 ■ ■ 305. :• ■....:;;,.. _ - 3:1 o. 

AAA TGG AAT AAC ACT TTA AGA CAG GTA GTT AAA AAA TTA 972 
Lys trp Asn Asn Thr Leu Arg Gin Val Val Lys Lys Leu 

60 -315 320 

AGA GAA CAA TTT GGG AAT AAA ACA ATA ATT TTT AAT CGA 1011 
Arg Giu Gin Phe Gly Asn Lys Thr. Tie lie Phe Asn Arg 
325 ' 330 335 
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TCC TCA GGA GGG GAC CCA GAA ATT GTA ATG CAC AGT TTT. 1050 
: Ser. Ser Gly Gly Asp Pro Glu Tie" Vai . Met His S r Phe 
?*9:. '• 345 - . ■ ■ 350 

AAC TCT GGA GGG GAG TTT TTC T AC TGT AAT AC A AC A CAA .1089 
5 Asa; Cys Gly: Gly Glu Phe Phe Tyr Cys Asn Thr Thr Glri ■■■■ 
• 355 ■ 360 

• • CTG. TTT VAAT AGT ACT TGG AAT AAT ACT GAA GGG ACA AAT 1 128 
Leu Phe Asri Ser Thr Trp Asn Ash Thr Glu Gly Thr Ash. • 
. 365 : ■ 370 ■ 375- " 

10 AGC ACT GAA GGA AAT AGC. ACA ATC ACA CTC CCA TGC AG A 1167 
Ser Thr Glu Gly Asn Ser Thr lie Thr Leu Pro Cys Arg 

380 385 
ATA AAA CAA ATT ATA AAT ATG TGG CAG GAA GTA GGA AAA 1206 
lie Lys Gin lie He Asn Met Trp Gin Glu Val Gly Lys ■ 
15 390 : 395 400 , '■■ 

GCA ACG TAT GCC CCT CCC ATC AGA GGA CGA ATT AG A TGC 124 5 
Ala Thr Tyr Ala Pro Pro lie Arg Gly Arg lie Arg Cys 

405 410 415 - 

ATA TCA AAT ATT ACA GGA CTG CTA TTA ACA AGA GAT GGT 1284 
lie Ser Asn lie Thr Gly Leu .Leu. Leu Thr Arg Asp Gly 

.. . . 420 425 

GGT AGG AAT GTC ACA AAC AAT ACC GAA ACC TTC AGA CCT 1323 
Gly Arg Asn. Val Thr Asn Asn Thr Glu Thr Phe Arg Pro 
430 435 440 ' 

25 GGA GGA GGA GAC ATG AGG GAC AAT TGG AGA AGT GAA TTA 1362 
Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu 

■ • .: 445 450 

TAT AAA TAT AAA GTA GTA AAA GTT GAA CCA TTA GGA ATA 1401 
. Tyr Lys . Tyr Lys Val Val Lys Val Glu Pro. Leu Gly lie 
30 455 460 465 

GCA CCC 'AGC AAG GCA AAG AGA AGA G.TG GTG CAC AGA GAC 1440 
Ala Pro; Thr Lys Ala Lys Arg Arg Val Val His Arg Asp 
470 475 . 480 

AAA AGA GCA GCA CTA GGA GCC TTG TTC CTT GGG TTC TTA 1479 
35 Lys Arg Ala Ala Leu Gly Ala Leu Phe Leu Gly Phe Leu 

. • 485 ' 490 . 

GGA GGA TAA AAG CTT CTA GA 1499 
Gly. Ala Xaa Lys Leu Leu 
■ 495 : . 499 

40 . 

CLONE CI 1.7 

GAG GTA CCT GTA TGG AAA GAA GCA ACC ACT ACT CTA 36 ^ 
: Glii. Val .Pro Val Trp Lys Glu Ala Thr Thr Thr Leu '• 
.. , - : :>- _ ; -5 ■ 10 

45 TTT TGT GCA TCA GAT GCT AAA GCA TAT GAC ACA GAG GTG 75 
Phe : Cys Ala Ser Asp Ala Lys Ala Tyr Asp Thr Glu Val 

■ ■ 15 • 20 25 
CAT AAT Gtt TGG GCC ACA CAT GCC TGT GTA CCC ACA GAC. 114 

. His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp 
50 30' 35 

CCC AAC CCA CAA GAA ATA GAA TTG GTA AAT GTG ACA GAA .153 
Pro Asn Pro Gin Glu lie Glu Lea Val Asri Val Thr Glu 
- 40 45 ■■ 50 : . . 

GAT TTT AAC ATG TGG AAA AAT AAA ATG GTA GAC CAG ATG 192 : 
55 Asp Phe Ash Met Trp Lys Asn Lys Met Val Asp Gin Met 
55 ■■ . ■ 60 

• • : CAT . GAG GAT ATA. ATC AGT TTA TGG GAT GAA AGC CTA AAG 231: 
His Glu Asp lie He Ser Leu Trp Asp Glu Ser Leu Lys ' 
■ " 65 : : . ' 70 .75 

60 CCA TGT GTA AAG TTA ACC CCA CTT TGT GTT ACT CTA AAC 270 • 
Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu Asn ; . 

. 80 ' 85 90 

TGC .AGT. GAT GTG AAC AAT TCC ACA AAT CCT AAT GAT ACT 309 : 
_ ;: Cys Ser Asp Val : Asn: Asn Ser Thr Asn Pro Asn Asp Thr . 
6 5 ■ 95 . • 100 . 
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AAT ACT AAT TCC ACT AAT. ACT ACT TCC TCT ACT CCT ACG 348 : 
Ash Thr Asn Ser Thr .Asn Thr Thr Ser Ser : Thr . Pro Thr. : 

^5 ' ' " 110 - : ' U* . 

GCC ACT ACT AGT AGC GAG GAA AAG ATG GAG AAG GGA GAA 387 

5 Ali Thr Thr Ser Ser Glu Glu Lys Met ;Glu Lys Gly Glu : ; ; 

■ : .. :• ;;:;::!:::"': 120— - ' ■ ' ■ : ' : 125;'' : . 

. . ATA AAA AAC TGC TCT TTC AAT ATC ACC ACA ; CAC ATG AAA 426 

. .. lie Lys Asn Cys Ser Phe Ash lie Thr Thr His Met- Lys • =■ 

130 ; 135 • • 140 

10 GAT AAG Gf A CAG AAA GAA TAT GCA CTT TTT TAT AAA CTT 465 

Asp Lys Vai Gin Lys Glu fyr Ala Leu Phe Tyr Lys Leu 

145 150 155 

GAT ATA Gf A CCA ATA GAT GAT AAT AAT ACC AGC TAT AGG 504 . 

Asp lie Val Pro lie Asp Asp Asn Asn Thr Ser Tyr Arg : 
15 •:. 160 ■ l 6 5 

Tf G ATA AGT TGT AAT ACC TCA GTC ATT ACA CAG GCC TGT 54 3 
Leu lie Ser Cys Asn Thr Ser Val lie Thr Gin Ala Cys. 

170 ' 175 . 180 

CCA ATG GTG ACC TTT GAG CCA ATT GCC ATA CAT TAT TGT 582 
20 Pro Met Val Thr Phe Glu Pro lie Pro lie His Tyr Cys 
,185 : 190 
GCC CCG GCT GGT TTT GCG ATT CTA AAG TGT AAA GAT AAG 621 
Ala Pro Ala Gly Phe Ala lie Leu Lys Cys Lys Asp Lys 
195 200 205 

2 5 AAG TTC AAT GGA ACA GGA CCA TGT TCA AAG GTC AGC ACA 660: 
Lys Phe Ash. Gly Thr Gly Pro Cys. Ser Lys Val Ser Thr. 

210: 215 . 220 .A,, 

: GTA CAA TGT ACA CAT GGA ATT AGG CCA GTA GTA TCA ACT 699 
Val Gin cys Thr His Gly lie Arg Pro Val Val Ser Thr 
30 " ■ 22 5 , ' •: .. 230 ' 

CAA CTG TTG TTA AAT GGC AGT CTT GCA GAA GAA GAA GTA 738 
Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val 

,235 " 240 245 

GTA ATT AG A TCT GTC AAT TTC ACA GAC AAT GCT AAA ATC 777. 
35 Val lie .Arg Ser . Val Asn Phe Thr. Asp Asn Ala Lys lie . 

: : 250 , 255- • : 

ATA ATA GTA CAG CTG AAA GAA CCT GTA GCA ATT AAT TGT 816: 
lie lie Val Gin Leu Lys Glu Pro Vai. Ala lie Asn Cys 
260- : 265 270 

40 ACA AG A CCC AAC AAC AAT ACA AG A AAA GGT ATA CAT CTA 855 
Thr Arg: Pro/ Asn Asn Asn Thr Arg Lys Gly lie His Leu 

• 275 2 80 . 285- : 

GGA CCA GGG AGC ACA TTT TAT- ACA ACA GGA GAA ATA ATA 894 :; 
Gly Pro Gly Ser thr Phe Tyr thr Thr Gly Glu lie lie 
45 ' ■■' : " : ^ [ -\:. 290 ;- ' : .' 295 

GGA GAC ATA AGA AAA GCA TAT TGC AAG ATT AGT AAA GAA 933 
Gly Asp Tie Arg Lys Ala Tyr Cys Lys: lie Ser Lys Glu 

. 300 ' 305 - : : 310 

AAA TGG AAt AAC ACT TTA AGA CAG GTA GTT AAA AAA TTA 972 : 
50 Lys Trp Asn Asn .Thr Leu Arg Gin Val Val Lys, Lys Leu ; 
■ ::: :: • •• 315 ' 320 

AGA GAA CAA TTT GGG AAt AAA ACA ATA ATT TTT AAT CGA 1011. 
Arg Glu Gin Phe Gly Asn Lys thr lie lie Phe Asn Arg 
- 32S'.' y :: . : - 330 335- ■ 

55 TCC TCA GGA GGG GAC CCA GAA ATT GTA ATG CAC AGT TTT 1050.. 
Ser Ser Gly Gly Asp Pro Glu lie Val Met His Ser Phe : 

340 : ■ ■ 345 350,. 

AAC TGT GGA GGG GAG TTT TTC TAC TGT AAT ACA ACA CAA 1089 . 
Asn Cys Gly Gly Glu phe Phe Tyr : Cys Asn. Thr Thr Gin 
60 355 : , 360. 

: CTG TTT AAT AGT ACT TGG AAT AAT ACT GAA GGG ACA AAT 1128 
Leu Phe: Asn .Ser Thr trp Asn Asn Thr Glu Gly Thr Asn.: 
365 ■ " 370 . , . ■ 375 . 
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AGC ACT GAA GGA AAT AGC ACA ATC ACA CTC CCA TGC AG A il67 
.. v. : : Ser Thr Glu Gly Asn. Ser ; Thr 'II Thr Leu Pro Cys Arb 
•I:" :;::Vy:,-.,,. 330 =': ^- =■=:.=»====• " = = = ^ ■ 385 • • 

c , ATA AAA CAA ATT ATA V.AATv ATG TGG CAG GAA CTA GGA AAA 1206 
.. 5 ; U© Lys Gin lie lie Ash: Met : f rp Gin Glu Val Gly Lvb : : 
390: -3:?5- : -v ::: '. : 400 

GCA ACG TAT GCC CCT CCC ATC AGA GGA CGA ATT AGA TGC 1245 
Ala thr. Tyr Ala. Pro; Pro^ l ie ; Arg Gly Arg lie Arg Cys 
nrt 4 °5 ;. -;-v: : ;: ; ;- : 410 ■. 415 

10 ATA TCA AAT ATT ACA GGA . CTG CTA TTA ACA AGA GAT GGT 1284 
lie Ser Asn lie Thr Giy Leu ; Leu Leu Thr Arg Asp Gly 

420 425 
GGT AGG AAT GTC ACA A AC AAT ACC GAN NCC TTC AGA CCT 1323 
Gly Arg Asn Val thr Asn Asn Thr Xaa Xaa Phe Arg Pro 
15 430 435 : 440 

GGA GGA GGA GAC ATG AGG G AC AAT TGG AGA AGT GAA TTA 1362 
Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu 

445 . 450 

TAT AAA - TAT AAA GTA GTA AAA GTT GAA CCA TTA GGA ATA 1401 
Tyr Lys Tyr Lys Val Val Lys Val Glu Pro Leu Gly lie 
. 455 460 465 

GCA CCC ACC AAG GCA AAG AGA AGA GTG GTG CAC AGA GAC 1440 
Ala Pro Thr Lys Ala Lys Arg Arg Val Val His Arg Asp 
• 47 ° 475 480:: 

25 AAA AGA GCA GCA CTA GGA GCT TTG TTC CTT GGG TTC TTA 1479 
Lys Arg Ala Ala Leu Gly Ala Leu Phe Leu Gly Phe Leu 

485 490 

GGA GCA TAA AAG CTT . CTA GA 1499 
Gly Ala Xaa. Lys Leu Leu 
30 49 5 : : : •■'4'9'9'- : ; : ';.. : 

CLONE C10 .5 

G GTA CCT GTG TGG AAA GAA.; GCA A AC ACA ACT CTA TTT 37 
Val Pro Val trp Lys Glu Ala Asn Thr Thr Leu Phe 
35 • 1 . . -...-5- 10 

TGT GCA TCA GAT GCt AAA GCA ; TAT GAT AGA GAA GTA CAT 76 
Cys Ala Ser Asp Ala Lys Ala Tyr Asp Arg Glu Val His 

15 1 . • V. ■ 20 , 2 5 

AAT GTT TGG GCA ACA CAT GCC : TGT GTA CCC ACA GAC CCC 115 
Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro 

■ ■ 30 . ■ 35 ■ . 

AAC CCA CAA. GAA: ATA GTA TTG GGA AAT GTG ACA GAA AAT 154 
.Asn Pro Gin Glu lie Val Leu Gly Asn Val Thr Glu Asn 
;" c : 40 ■ • /^V:: : "\ ; y-r:-. : 4.S;; 50 

4 5 TTT AAC ATG TGG AAA AAT:: AAC ATG .GTA GAA CAA ATG CAT 193 
Phe Asn Met Trp. Lys Asn Ash/ Met Val Glu Gin Met His 

55 • 60 
GAG GAT ATA ATC AAT TTA TGG GAT CAA AGC TTA AAC CCA 232 
Glu Asp lie lie Asn Leu Trp Asp Gin Ser Leu Lys Pro 
50 65 .. ■ 70 • 7 5 " : - 

TGT GTA AAG TTA ACT CCA CTC TGT GTT ACT TTA AAG TGC 271 
Cys Val Lys Leu Thr Pro Leu Cys Val . Thr Leu Lys Cys 

• 80 . . 85 • go •• ' 

;. AAG GAT CTG GAG AGG AAT: ACT ACC TAT AAT AGC ACT ATT 310 
33 Lys Asp Leu Glu Arg. Asn Thr Thr Tyr Asn Ser Thr lie 

:95. ' . '. . . . 100 
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He 
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Ala ' 
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CCA TGT 


AAA 
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TGT 
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GGA. ATT AGG 
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Gin 
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Val 










225 






230 








GTA 


ATC 


AGA 


TCT 


GCC 


AAT 


TTC ACA 


GAC 


AAT 


GCT 


AAA 


ft 
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He 
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He 
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He 
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, : : : Th ; r : . ; 


Glu 












420 










425 








ATC 


TTC 


AGA 


CGT 


GGA 


GGA GGA 


GAT 


ATG 


AGG: 


GAC 


AAT 


TGG 


1324 


lie Phe 


Arg Pro .Gly 


: P^y. : Gly. 


ASP; 


Met 


Arg" Asp 


..As : n : '.' 








430 










435 










440 




AGA 


AGT 


GAA 


TTA 


TAT 


AGA 


TAT 


AAA 


GTA 


GTA 


AAA 


ATT 


GAA 


1363 


Arg 


Ser 


Glu 


Leu 


Tyr 


Arg : 


Tyr 


Lys 


Val 


Val 


Lys 


lie 


Glu 










445 








450 










.CCA 


TTA 


GGA 


ATA 


GCA 


CCC 


ACC 


AGG 


GCA 


AAC 


AGA 


AGA 


GTG 


1402 


Pro 


Leu 


Gly 


lie 


Ala 


Pro 


thr 


Arg 


Ala 




Arg 


Arg 


Val 




455 










460 










465 






GTG 


CAG 


AGA 


GAA 


AAA 


AGA 


GCA 


GTA 


GGA 


CTA 


GGA GCT 


TTG 


1441 


Val 


Gin 


Arg 


Glu 


Lys 


Arg 


Ala 


Val 


Gly 


Leu 


Gly 


Ala 


Leu 








470 










47 5 








480 





. 15 . 

TTC CTT GGG T TCTTAGGAG CATAAAGCTT CTAGA 1475 . . 
Phe Leu Gly 

■ 483 . : . .".•./'• 

.2 0 CLONE CIO. 7 

G GTA CCT GTG TGG AAA GAA GCA AAC ACA ACT CTA TTT 37 

Val Pro Val Trp Lys Glu Ala Ash Thr Thr Leu Phe 

V I' . ' 5 ; " io . 

TGT GCA TCA GAT GCT AAA GCA TAT GAT AGA- GAA GTA CAT 76 
.25 Cys Ala Ser Asp Ala Lys Ala Tyr Asp Arg Glu Val His ... 

. ' 15 20 25 

AAT; GTT TGG GCA ACA CAT GCC TGT GTA CCC ACA GAC CCC 115 
Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp :Pro 
30 ' ■.. 35 

. 30; AAC CCA CAA GAA ATA GTA TTG GGA AAT GTG ACA GAA AAT 154 
Asn Pro Gin Glu lie Val Leu Gly Asn Val thr Glu Asn 

40 ■ ■ -45 • - 50 

TTT AAC ATG TGG AAA AAT AAC ATG GTA GAA CAA ATG CAT 193 
Phe Asn Met Trp Lys Asn Asn Met Val Glu Gin Met His 
35 , , . -55 • 60, 

GAG GAT ATA ATC AAT TTA TGG GAT CAA AGC TTA AAG' CCA .232 
Glu Asp lie lie. Asn Leu Trp Asp Gin Ser Leu Lys Pro 
• : 65 . • 70 "... 75: : 

TGT GTA AAG TTA ACT CCA CTC TGT GTT ACT TTA. AAG TGC 271 
Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu Lys Cys 
'.. 80, 85 . 90 

, AAG GAT CTG GAG AGG AAT ACT ACC TAT AAT AGC ACT ATTv 310 
••• Lys . Asp Leu Glu Arg Ash Thr Thr Tyr Asn : Ser Thr l ie . : 
95 100 ."•' ; '. : V V; -':7 

4 5 ACC AAT AAT AGT AGT TTG GAG GGA CTA AGA GAA CAA ATG 349 
.:, Thr Asn Asn Ser Ser Leu Glu Gly Leu Arg Glu Gin Met : 
• : 105 ■ 110 .' .115 : 

ACA AAC TGC TCT TTC AAC ATC ACC ACA AGT ATA AGA GAT 388 
Thr Asn Cys Ser Phe Asn lie Thr Thr Ser lie Arg Asp 
50 120 125 .. ■ 

AAG GTG CAG AAA GAA TAT GCA CTT TTG TAT AAA CTT GAT 427 
Lys Val Gin Lys Glu Tyr Ala Leu Leu Tyr Lys Leu Asp 
■ •■ ■ 130 . 13 5 14b '.T 

_ GTA GTA CCA ATA GAA GAA GAT. GAC AAT ACT AGC TAT AGA 466 
55 Val Val Pro lie Glu Glu Asp Asp Asn Thr Ser Tyr Arg 
■ ■ '14:5 150 ■ - : r' :: 155'. 

TTG ATA ACT TGT AAC ACC TCA GTC ATT ACA CAG GCT TGT 505 
Leu He Ser Cys Asn thr Ser Val lie Thr Gin Ala Cys; 
. . : 160 165 

60 CCA AAG ACA TCC TTT GAG CCA ATT CCC ATA CAT TAT TGT 544 
Pro Lys Thr Ser Phe Glu Pro lie Pro lie His Tyr Cys 

170 175 . 180 

GCC CCG GCT GGT TTT GCG ATT CTA AAG TGT AAT GAT AAG' 583 
Ala Pro Ala Gly Phe Ala. He Leu Lys Cys Asn Asp Lys 

65 185 190 
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10: 



15 



20 



25 



30 



35 



40 



4 5 



50 



55 



60 



AAG 


TTC, 


AAT GGA AGA GGA 


CCA 


TGT 


AAA AAT GTC AGC 




O4C £. 


Lys 


Phe 


Asn Gly Thr Gly 


Pro 


Cys 


Lys 


Asn 


Val: 
205. 


Ser 


Thr 




195. 










200 














OO 1 


GTA 


CAA 


TGT 


ACA 


CAT GGA 


ATT 


AGG 


CCA GTA 


GTA 


TCA 


ACT 


Val 


Gin 


Cys 


Thr His Gly 


lie Arg 




Va 1 


Val 


Ser 


Thr 








210 










215 










h *i r\ 
2 20 




CAA 


CTG. 


TTG TTA AAT 


GGC 


ACT. 


CTA 


GCA GAA GAA GAG 


GTA.. 


/,uu 


Glh 


Leu 


Leu 


Leu 


Ash Gly 


Ser 


Leu 


Ala 


Glu 


Glu 


Glu 


Val 












225 










230: 










GTA 


ATC 


AGA 


TCT 


GCC. AAT 


TTC 


ACA 


GAC AAT 


GCT.V 


AAA 


ACC 


.739 . 


Val 


lie 


Arg 


Ser 


Ala 


Asn 


Phe 


Thr 


Asp Ash Ala 


Lys' 


Thr 






235 








240 










245 






ATA 


ATA 


GTA 


CAT 


CTA 


AAT 


GAA ACT 


GTA AAA 


ATT 


AAT 


TGT 


77:8 


lie 


lie 


Val 


His 


Leu 


Asn 


Glu. 


Thr 


Val 


Lys,: 


.He.;. 


Asn 


Cys 










250, 










255 












AC A 


AGA 


CTT 


GGC 


AAC 


AAT 


ACA 


AGA 


AAA 


AGT 


ATA AAT 


ATA 


817 


Thr 


Arg 


Leu Gly 


Asn 


Asn 


Thr 


Arg 


Lys 


Ser 


He 


Ash 


1 le 




260. 








265 










270 








GGA 


CCA 


GGG 


AGA 


GTA 


CTC 


TAT 


GCA 


ACA GGA 


GAA 


■ATA 


ATA 


b c c 

ODD 


Gly 


Pro 


Gly 


Arg. 


Val 


Leu 


Tyr 


Ala 


Thr 


Gly 


Glu. 


He 


1 1 e 






275 








280 










285 




GGA 


GAC 


ATA 


AGA 


CAA 


GCA 


CAT 


TCt 


AAC 


ATT 


AGT 


AGA 


GCA 


89 5 


Gly 


Asp 


■ lie- 


Arg 


Gin 


Ala 


His 


Cys 


Ash 


lie 


Sec 


Arg 


Ala 






290 










295" 










CAA 


TGG 


AAT 


AAG 


ACT 


TTA 


GAA 


AAG. 


GtA 


GTt 


GAC 


AAG 


TtA 


934 


Gin 


Trp 


Asn 


Lys 


Thr 


Leu 


Glu 


Lys 


Val 


Val 


Asp 


Lys 


Leu 






300 








305 










310 






AG A 


AAA 


CAA 


TTT. 


GGG 


GAT 


AAT 


ACA 


ACA 


ATA 


GCT 


: TTT 


AAT 


973 


Arg 


Lys 


■Gin 


Phe 


Gly 


Asp 


Asn 


Thr 


Thr 


lie 


Ala 


Phe 


Asn 






315 










320 










-vrv'i t 


CGA 


TCC 


TCA 


GGA 


GGG 


GAC 


CCA 


GAA 


ATT GTA 


ATG 


CAC 


ACT 


Arg. 


Ser 


Ser 


Gly 


Gly. 


Asp 


Pro 


Glu 


lie 


Val 


Met 


His 


Thr 




32 5 








330 










33.5 








TTT 


AAT 


TGT 


GGA 


GGG 


GAA 


TTT 


TTC 


TAC 


TGT 


AAT 


ACA 


ACA 


1 r\ c 1 
10 D 1 


Phe 


Asn 


Cys 


Gly Gly 


Glu 


Phe 


Phe 


tyr 


Cys 


Asn 


Thr 


Thr 








340 










345 










350 


1090 


CAA 


CTG 


TTT 


AAT 


AGT 


ACT 


TGG 


AAT 


AAT 


ACT 


TGG 


AAG 


GAT 


Gin 


Leu 


Phe 


Asn 


Ser 


Thr 


Trp 


Ash 


Asn 


Thr' Trp Lys 


Asp 












355 








360: 










CCT 


AAC 


AGG 


AGT 


GAC 


AAT 


ATC 


ACA 


CTC 


CCA 


TGC 


AGA 


ATA 


1129 


Pro 


Asn 


Arg. Ser 


Asp 


Asn 


He 


thr. 


Leu 


Pro 


Cys Arg 


lie 






365 








370 










37 5 






AAA 


CAA 


ATT 


ATA AAC 


ATG 


TGG 


CAG 


GAA 


GTA 


GGA 


AAA 


GCA 


1 1 68 


Lys 


Gin 


He 


lie 


Ash 


Met 


Trp 


Gin 


Glu 


Val Gly 


Lys 


Ala 








380 










385 












ATG 


TAC 


GCC 


CCT 


CCC 


ATC 


AGA 


GGG 


GAA 


ATT 


AGA TGT. 


TCA 


1207 


Met 


Tyr 


Ala 


Pro 


Pro. 


He 


Arg Gly 


Glu 


He 


Arg : 


Cys 


Ser 




390 








395 










400 








. TCA 


AAT 


ATC 


ACA 


GGG 


CTG 


CTA 


CTA 


ACA 


AGA 


GAT 


GGT 


GGT 


12 4 D 


Ser 


ASTi 


lie Thr Gly 


Leu 


Leu 


Leu 


Thr 


Arg 


Asp Gly 


Gly 








405 










410 










415 




. AAT 


GAC 


GAT 


GGT 


AAT 


GAC 


ACG 


ACC 


ACA 


AAC 


AGG 


ACC 


GAG 


1285 


Asn 


Asp 


Asp Gly 


Asn 


Asp 


Thr 


Thr Thr Asn 


Arg 


Thr 


G 1 u 










420 










42 5 










ATC 


TTC 


AGA 


CCT GGA 


GGA 


GGA 


GAT 


ATG 


AGG 


GAC 


AAT 


.TGG 


1324 


lie 


Phe 


Arg 


Pro Gly 


Gly 


Gly 


Asp 


Met 


Arg Asp Ash 


: -Trp 






4.30 








435 










440 




1363 


AG A 


AGT 


GAA 


TTA 


TAT 


AGA 


TAT 


AAA 


GTA GTA AAA ATT 


GAA 


Arg 


Ser 


Glu 


Leu 


tyr 


Arg 


Tyr 


Lys', 


Val 


Val Lys 


lie 


Glu 








445 








450 










1402 


CCA 


TTA 


GGA ATA 


GCA 


CCC 


ACC 


AGG 


GCA 


AAG 


: AGA 


AGA 


GTG 


Pro 


Leu 


Cily 


lie 


Ala 


Pro 


thr 


Arg 


Ala 


Lys Arg 


Arg 


Val 




455 








460 










.465 
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GTG 


CAG 


AGA 


GAA 


: AAA 


AGA 


GCA 


GTA 


GGA 


CTA 


GGA 


.GCT. 


TTG 


144 




Val 


■Gin 


Arg 


Glu : 


.'.Lys 


■ Arg' 


Ala 


Var Glv 


Leu 


: G ly-: Ala : LeU : 










470 








475 










480 






:TTC 


CTT 


GGC 


TTC : TTG 


GGA 


GCA 


TAA 


AGC TTC. 


: TAG. 


A 14 7 5 




■■■ .5; . 


Phe 


Leu 


Glv 


pHp 


Leu 


Gly .Ala Xaa 


Ser : 


Phe 


•■Xaa' 


















MO J; 










490 


491 
























CLONE C 17 . 1 














CTC 


GAG 


uln. 


npT . 




TPP 


ft 'ft IV 

AAA 


ry ft-ft 
UAA 


GCA 


ACC 


: ACC 


KPT 
Aw 1 


J O 


10 




Leu 


Glu 


v a x 


Pro 


va i 


Trp 


Lys 


Glu 


Ala Thr 


Thr 


Thr 








1 








c 






10 










CTA 


■TTT 


TGT 






RAT 


.PPT 


AAA 


GCA 


TAT 


GAT 


TCA 


p^p 






Leu 


Phe 


Cvs 


Ala 


Ser 


Asp 


nla 


Ly s 


Ala 


■Tyr. 


Asp Ser 


C In 










15 










20 














15 


GCA 


CAT 


AAT 


GTT 


TGG 




api 


CAT 


GCC 


TGT . 


GTA 


CCC 


MUM 


lit 




Ala 


His 


Asn 


Va 1 


Trp 


nld. 


i n r 


His 


A 1 a . 


Cys Val 


Pro 


■l nr. 














in 










.: 35 












GAC 


CCC 


AAC 


v*CA 


pat 


PUB 


O 1 ft 


GAA 


TTG. 


GAA 


AAT 


GTG 


a p ft 
ACA 


1 5 3 




Asp 


Pro 


Ash 


Pro 




G lu 




Glu 


Leu . 


Glu 


Asn 


Val 


■ TK *» 

i nr 




2 0 




40 




















SO 








GAA 


AAT 


TTT 




/VI o 




fi A A 
AAA 


AAT 


AAC 


ATG 


GTA 


GAA 


• ■ 
■ CAG 


192 




Glu 


Ash 


Phe 


Asn 


Met 


Trp 




Asn 


Asn 


Met . 


Val 


Glu 














5 5 








bO 














ATG 


CAT 


GGG 


un 1 


ATA 
nln 


Mil 


no 1 


TTA 


TGG 


GAT 


CAA 


AGC 


CTA 


7"J.', 
£. J 1 


25 


Met 


His 


Glv 


Asp 


T 1 a 
lie 


lie 


C d »- 


Leu 


Trp 


Asp 


Gin 


Ser 


Leu 






65 










in 

vu 










75 










AAG 


CCA 


TGT 


GTA 


AAA 


TTA 


ACC 


CCA 


QTP ' 


TGT 


GTT ACG 


TT ft 

TTA 






Ly s 


Pro 


cys 


Val 


Lys 


Lei* 


Thr 


Pro 


Leu* 


Cys 


val : 


Thr 


Leu 










80 










85 








90 




3 0 


AAT 


TGC 


act; 


GAC 


/—•/*» t\ 


AAT 


GTT 


ACT 


AAT 


AGC 


GAG 


AGA 


ACG 


■J \J 7 




As h 


Cvs 


thr 


Asp 


. r ro 


Asn 


Val 


Thr 


Asn 


Ser 


Glu 


Arg 


l:n r . 
























100 












ATA 


GAG 


GGG 


GGA 


. rat ■ 

VjAA 


A 1 ft. 


AAA 


AAT 


TCP 


TCT 


TTC 


AAT 


& TP ' 
AIL. 


,3 ** O 




lie 


Glu 


Gly 


Gly 


Glu 


I 16 


Lys 


Asn 


Cys 


Ser 


Phe 


Asn 


lie 




3 5 




105 








110 










115 








ACC 


ACA 


aac 


ATA 


AGA 


GAT 


AGG 


TTT 


CAG 


AAA 


GAA 


TAT 


o^*A 


38 7 




Thr 


Thr 


Asn 


lie 


Arg 


Asp 


Arg 


Phe 


Gin 


Lys 


Glu 


Tyr 


Ala 












12 b 










125 














CTT 


TTT 


TAT 


AAA 


CTT 


P R. T 

Oft 1 


GTA 


ATA 


CCA 


TTA 


GGT 


AAT 


PAT 


426 


40 


Leu 


Phe 


Tyr Lys 


Lett 


Asp 


Val 


lie 


Pro 


Leu 


Gly 


Asn 


Asp 






130. 










i i ^ 










140 










AAT 


ACT 


AGC 


TAT : 


AGG 


1 lu 


ATA : 


AGT 


TGT 


AAC 


ACC 


TCA 




4 65 




Asn 


Thr 


Ser 


Tyr 


Arg 


Leu 


lie 


Ser 


Cys 


Asn 


Thr 


Ser 


Val 




.45 






14 5 








150 










15 5 




ATT 


ACA 


CAG 


GCC 


TGT. 


CCA 


AAG GTA 


TCC 


TTT 


GAG 


CCA 


ATT 


504 




lie 


Thr 


Gin 


Ala 


Cys 


Pro 


Lys 


Val 


Ser 


Phe 


GlU 


Pro 


lie 














160 










165 












ccc 


ATA 


CAT 


TAT 


TGT 




CCG 


GCT 


GGT 


TTT 


GCG. 


ATT 


CTA 


543 




Pro 


lie 


His 


Tyr 


Cys 


Ala 


Pro 


Ala 


Gly 


Phe 


Ala 


lie 


Leu 




50 




170 






175 








180 








AAG 


TCT 


AAA GAT 


AAG 


AAG 


TTC 


AAT 


GGA 


ACA 


GGA 


CCA 


TGT 


582 




Lys 


Cys 


Lys 


Asp Lys; 


Lys 


Phe 


Asn 


Gly 


Thr 


Gly: 


Pro 


Cys 












185 










190 














ACA 


AAT 


GT.C. 


AGC 


ACA 


GTA 


CAA 


TGT 


ACA 


CAT: 


GGA 


ATT 


AAG 


621 


. 55 


Thr 


Asn 


Val 


Ser 


Thr 


Val 


Gin 


Cys 


Thr 


His 


Gly 


Tie 


Lys 






195 










200 










205 








CCA 


GTA 


GTA 


TCA 


ACT 


CAA 


CTG 


TTG 


TTA AAT 


GGC 


AGT 


CTA 


660 




Pro Val 


Val 


Ser 


Thr 


Gin 


Leu 


Leu 


Leu 


Ash 


Gly 


Ser 


Leu 










.210 










215 








220 




60 


GCA 


GAA 


GAA 


GAC 


ATA 


GTA 


ATT 


AGA 


TCC 


GCC 


AAT 


CTC 


ACA 


699 




Ala 


Glu 


Glu 


Asp 


lie 


Val 


lie 


Arg 


Ser 


Ala 


Ash 


Leu 


Thr 












225 










230 












GAC 


AAT 


GCT 


AAA 


AAC 


ATA 


ATA 


GTA 


CAG 


CTG 


AAT 


GAA 


TCT 


7 38 


65 


: Asp: Asn 


Ala 


Lys 


Asn 


lie 


lie 


Val 


Gin 


Leu 


Ash 


Glu 


Ser 






235 








240 










24 5 







-36^ 



WO98/01564 



PCT/US97/09690 





GTA 


ACA 


ATG; 


AAT 


TGT ACA 


AGA 


CCC 


AAC 


AAC: 


AAT ACA 


ATG 


777 




Val 


Thr 


Met 


Asn 


Cys Thr 


Arg 


Pro 


Ash 


Asn 


Ashv-Thr- 


Met 












250 








255 














AAA 


AGT 


ATA 


:CAT 


ATA GGA 


CCA 


GGC 


AGA 


GCA 


TTT. TAT 


GCA 


816 




Lys 


Ser 


lie 


His 


lie Gly. 


Pro 


Gly 


Arg, 


Ala 


;Phe.--Tyr 


Ala 






260 










265 










270 










ACA 


GGA 


AAC 


ATA 


ATA 


GGA: 


GAT 


ATA 


AGA 


CAA; 


GCA CAT 


TGT 


855 




Thr 


Gly Asn 


lie 


lie Gly 


Asp 


He 


Arg 


Gin.. 


Ala His 


Cys 




















280 










285 




10 


AAC 


ATT 


AGT 


GGA 


ACA 


AAA 


TGG 


AAT 


GAC 


ACT 


TTG 


AAA 


AAG 


894 




Asn 


Tie 


Ser 


Gly 


Thr 


Lys 


Trp 


Asn 


Asp 


Thr 


Leu 


Lys 


Lys 












290 










295 












. ATA 


GCT 


ATA 


AAA 


TTA 


AGA 


GAA 


CAA 


TTT 


AAT 


aag: 


ACA 


ATA 


9 33 




He 


Ala 


T lo 

JLi.cs 


Lys 


Leu 


Arg 


Glu 


Gin 


Phe 


Asn 


Lys 


Thr 


lie 




15 




300 








305 










310 








GTC 


TTT 


AAT 


CAA 


TCC 


TCA. 


GGA 


GGG 


GAC 


CCA 


GAA 


ATT 


GCA 


972 




Val 


Phe 


Asn 


Gin 


Ser 


Ser 


Gly 


Gly 


ASp 


Pro 


Glu 


lie 


Ala 












315 








320. 














ACG 


CTC 


AGT 


TTT 


AAT 


TGT 


GGA 


GGG 


GAA 


TTT 


TTC 


TAC 


TGT 


1011 


20 


Thr 
325 


Leu 


Ser 


Phe 


Asn 


Cys 
330 


Gly 


Gly 


Glu 


Phe 


Phe 
335 


Tyr. 


Cys 






AAT 


TCA 


ACA 


CAA 


CTG 


TTT 


AAT 


AGT 


ACT 


TGG 


AAT 


AGT 


ACT 


1050 




Asn 


Ser 


Thr 


Gin 


Leu 


Phe 


Asn 


Ser 


Thr 


Trp 


Asn 


Ser 


Thr 










340 










345 










350 


1089 


25 


, GGG 


TCA 


AAT 


AAC 


ACT 


AAA 


GGA 


AAT 


GAC 


ACA 


ATC 


ACA 


CTC 




Gly 


Ser 


Asn 


Asn 


Thr 


Lys 


Gly 


Asn 


Asp 


Thr 


lie 


Thr 


Leu 












355 










360 












CCA 


TGC 


AGA 


ATA 


AGA 


CAA 


ATT 


ATA 


AAC 


ATG 


TGG 


CAG 


AAA 


1128 




.Pro 


9V 3 


Arg 


He 


Arg Gin 


lie. 


tie 


Asn 


Met 


Trp Gin 


Lys 




30 














370 










.375 








ATA 


GGA 


AAA 


GCA 


ATG 


TAT 


GCC 


CCT 


CCC 


ATC 


AAA 


GGG 


CAA 


1167 




lie 


\j i. y 


T 

Ly j 


Ala 


Met 


Tyr 


Ala 


Pro 


Pro 


lie 


Lys 


Gly 


Gin 








380 










385 














ATT 






TCA 


TCA 


AAT 


ATT 


ACA 


GGG 


CTA 


ATA 


TTA 


ACA 


1206 


3 5 


lie 


Arg 


Cys 


Ser 


Ser 


Ash 


lie 


Thr 


Gly 


Leii 


lie 


Leu 


Thr 






390 






395 










400: 










AG A 


GAT 


GGT 


GGT 


AAC 


AAC 


AAC 


ATG 


AGC 


A AG 


ACC 


ACC : 


GAG 


1245 




Arg 




Gly 


Gly 


Asn 


.Asn 


Asn 


Met 


Ser 


Lys 


Thr 


Thr. 


Glu 






405 








410 










415: 




4 0 


ACC 


lit 


AGA 


CCT 


GGA 


GGA 


GGA 


GAT 


ATG 


AGG 


GAC. 


AAT 


TGG. 


1284 




Thr 


Phe 


Arg;. 


Pro 


Gly 


Gly 


Gly 


Asp 


Met. 


Arg 


Asp 


Asn 


Trp 












420 










425 












AGA 


AGT 


GAA 


TTA 


TAT AAA 


TAT 


AAA 


GTA 


GTA 


AAA 


ATT 


GAA 


132 3 




Arg 


Ser 


Glu 


Leu. 


Tyr 


Lys 


Tyr 


Lys 


Val 


Val 


Lys 


lie 


Glu 




45, 


din 

•f J u 










435 










440 








CCA 


TTA 


GGA 


GTA 


GCA 


CCC 


ACC 


AGG 


GCA 


A AG 


AGA 


AGA 


GTG 


1362 




Pro 


Leu 


Gly 


Val 


Ala 


Pro 


Thr 


Arg 


Ala 


Lys 


i Arg 


Arg 


Val 










445 










450 














GTG 


CAG 


AGA 


GAA 


AAA 


AGA 


GCA 


GTG 


GGA 


ATA 


GGA 


GCT 


GTG 


1401 


50 


Val 


Gin 


Arg 


Glu 


Lys 


Arg 


Ala 


Val 


Gly 


lie 


Gly 


.Ala 


Val 






455 






460 










465 










-• TTC 


CTT 


GGG 


TTC 


TTG 


GGA 


GCA 


TAA 


AGC 


TTC 


TAG 


A 1435 






Phe 


Leu 


Gly 


Phe 


Leu Gly 


Ala 


Xaa 


Ser 


Phe 


Xaa 














470 










475 






478 








55 














































CLONE CI 7. 3 














CTC 


GAG GTA 


CCT 


GTG 


TGG 


AAA 


GAA GCA 


ACC 


ACC 


ACT 


36 






Leu 


Glu 


Val 


Pro 


Val 


Trp 


Lys 


Glu 


Ala 


Thr 


Thr 


Thr 








1 








5 








10 








60 


. CTA 


TTT 


TGT 


GCA 


TCA 


GAT 


GCT 


AAA 


GCA TAT 


GAT 


TCA 


GAG. 


75 




Leu 


Phe 


Cys. Ala 


Ser Asp 


Ala 


Lys Ala Tyr Asp 


Ser 


Glu 










15 










20 










25 






GCA 


CAT 


.AAT 


GTT 


TGG 


GCC 


ACA 


CAT 


GCC TGT GTA 


CCC 


ACA 


114 




Ala 


His 


Asn 


Val 


Trp 


Ala 


Thr 


His 


Ala Cys 


Val 


Pro 


Thr 




65 










30 










.35 
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; GAC CCC AAC.CCA CAA GAA GTA GAA TTG GAA AAT GTG : ACA 1 S3 

:: Asp Pro Asn Pro G i n G lu Val Glu Leu Glu Asn .Val. Thr 

■■ 4o ■ 45 ' v; : : . : : .5o : :: ;, 

'• GAA AAT TTT AAC ATG TGG AAA AAT AAC ATG GTA !GAA : CAG 192 
; . 5 Glu Asn Phe Ash Met Trp Lys Asn Asn Met Val :: .Clu Gin 

: . : '• ' 55. : :: .■.>". eo : : ;.,' : - : -.V'': : : . : ■ : -':V : - T • •• 

:: ATG CAT GGG GAT ATA ATT ACT TTA TGG GAT CAA AG C CTA 2 31 

Met His Gly Asp lie lie Ser Leu Trp Asp Gin Ser Leu 
.65 70 : ' : 75 - ' 

; 10 . AAG CCA TGT GTA AAA TTA ACC CCA CTC TGT GTT • ACG TTA 270 : 

. Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu 
80 85 90 

AAT TGC ACT GAC CCA AAT GTT ACT AAT ACC GAG AGA ACG 309 

Asn Cys Thr Asp Pro Asn Val Thr Asn Ser Glu Arg Thr 

15 95 100 

ATA GAG GGG GGA GAA ATA AAA AAT TGC TCT TTC AAT ATC 348 

lie Glu Gly Gly Glu lie Lys Asn Cys Ser Phe Asn lie 
105 no 115 ; 

. ACC ACA AAC ATA AGA GAT AGG TTT CAG AAA GAA TAT GCA 387 

2 0 Thr Thr Asn He Arg Asp Arg Phe Gin Lys Glu Tyr Ala 
120 125 

: GTT TTT TAT AAA CTT GAT GTA ATA CCA TTA GGT AAT GAT 426 

Leu Phe Tyr Lys Leu Asp Val He Pro Leu Gly Asn Asp 
130 135 140 

2 5 AAT ACT AGC TAT AGG TTG ATA ACT TGT AAC ACC f CA GTC 465 

Asn Thr Ser Tyr Arg Leu He Ser Cys Asn Thr Ser Val 
145 150 155 

ATT ACA CAG GCC TGT CCA AAG GTA TCC TTT GAG CCA ATT 504 

lie Thr Gin Ala Cys Pro Lys Val Ser Phe Glu Pro lie 
3Q 160 165 

CCC ATA CAT TAT TGT GCC CCG GCT GGT TTT GCG ATT CTA 543 

Pro lie. His Tyr Cys Ala Pro Ala Gly Phe Ala lie Leu • 

170 175 180 

AAG TGT AAA GAT AAG AAG TTC AAT GGA ACA GGA CCA TGT 582 

3 5 Lys Cys Lys Asp Lys Lys Phe Asn Gly Thr Gly Pro Cys 

185 190 

ACA AAT GTC AGC ACA GTA CAA TGT ACA CAT GGA ATT. AAG 621 

Thr Asn Val Ser Thr Val Gin Cys Thr His Gly lie Lys 
1?5 200 205 : 

4 0 CCA GTA GTA TCA ACT CAA CTG TTG TTA AAT GGC AGT CTA 660 

Pro Val Val. Ser Thr Gin Leu Leu. Leu Asn Gly Ser Leu / 

.. -210 215 ■ 220 

V^: : GCA .GAA GAA GAC ATA GTA ATT AGA TCC GCC AAT CTC ACA 699 

. ; Ala Glu Glu Asp lie Val lie Arg Ser Ala Ash Leu Thr 
45 ". 225 . 230 : 

GAC AAT GCT AAA AAC ATA ATA GTA CAG CTG AAT GAA TCT 7 38 

: Asp Asn Ala Lys Asn lie lie Val Gin Leu Asn Glu Ser 

'. 235 ■" 240 '.••'•2:4:5. ; 

GTA ACA ATG AAT TGT ACA AGA CCC AAC AAC AAT ACA ATG 7.77 

50 Val Thr Met Asn Cys Thr Arg Pro Asn Asn Asn Thr Met . 
250 ■ 255 

AAA AGT ATA CAT ATA GGA CCA GGC AGA GCA TTT TAT GCA 816 

Lys Ser lie His lie Gly Pro Gly Arg Ala Phe Tyr Ala 
260 . ■ ' 2 65 - 270 : ' 

55 ACA GGA AAC ATA ATA GGA GAT ATA AGA CAA GCA CAT TGT 855 

Thr Gly Asn lie lie Gly Asp lie Arg Gin Ala His Cys " 

2 75 . 280 ' 285 

AAC ATT AGT GGA ACA AAA TGG AAT GAC ACT TTG AAA AAG B9A ■■ 

Asn lie Ser Gly Thr Lys Trp Asn Asp Thr Leu Lys .Lys 
60 290 295 

ATA GCT ATA AAA TTA AGA GAA CAA TTT AAT AAG ACA : ATA ; 93 3 

ile Ala lie Lys Leu Arg Glu Gin Phe Asn Lys Thr lie ;: 
: 300 . . 305 310 . 
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GTC 


TTT 


AAT 


CAA TCC TCA 




Val 


Phe : 


Asn 


Gin Ser Ser ; 










315 








: ACG CTC 


AGT 


TTT 


AAT; TGT : 




.Thr Leu 


Ser 


Phe Asn Cys 




325 










330 




AAT TCA 


ACA 


CAA 


CTG 


TTT 




Asn 


Ser 


Thr 
340 


Gin 


Leu 


Phe 


10. 


GGG 


TCA 


AAT 


AAC 


ACT 


AAA 




Gly. Ser 


Asn 


Asn 


thr 


Lys. 












355 










AGA 


ATA 


AGA 


CAA 




Pro 


Cy s 


Arg 


lie 


Arg 


Gin 


15 
















ATA 


GGA 


AAA 


GCA 


ATG 


TAT 




T 1 * 

*■ 4-6 




Lys 


Ala 
380 


Met 


Tvr 




ATT 


AGA 


TGT 


TCA 


TCA 


AAT 


20 


He 


Arg 


Cys 


Ser 


Ser 


: AS h 




:i 390 






395 




AG A 


iS AT 


GGT 


GGT 


AAC 


AAC 




Arg 


Asp 


Gly 


Gly 


Asn 


Asn 








405 






: GGA 


2 5 :V 


ACC 


TTC 


AGA 


CCT 


GGA 




. Thr 


Phe 


Arg 


Pro Gly 


Gly 










420 






AG A 


AGT 


GAA 


TTA TAT 


AAA 




Arg 


Ser 


Glu 


Leu 


Tyr 


Lys 


30 


430 












CCA 


TTA 


GGA 


GTA 


GCA 


ccc 




Pro 


Leu 


Gly 


val 


Ala 


Pro 








445 








GTG 


CAG 


AGA 


GAA 


AAA 


AGA- 


: 35 


Val 


Gin 


Arg 


Glu 


Lys 


Arg 




455 








460 




TTC 


CTT 


GGG 


TTC 


TTG 


GGA 




Phe 


Leu 


Gly 
470 


Phe 


Leu 


Gly 



■■•v.:. . ' 320 . ;,\ : :'v.%^^ • • / " ' ' .": " 

GGA GGG GAA. TTT TTC f AC TGT 1011 
Gly Gly Glu Phe Phe Tyr. Cy 
335 

AAT AGT ACT TGG AAT ; AGT ACT 1050 ; . 
Asn Ser Thr Trp Asn Ser Thr :: 

345 : ./ 350 

GGA AAT GAC ACA ATC ACA CTC 1089 
Gly Asn Asp Thr lie Thr Leu 

.360 . . . 

ATT ATA: AAC ATG TGG CAG AAA 112 8 
lie lie Asn Met Trp .Gln Lys 
370 , ' 375 /;; 

GCC CCT CCC ATC AAA GGG CAA 1167 
Ala Pro Pro lie Lys Gly Gin 
3B5 

ATT ACA GGG CTA ATA TTA ACA 1206 
lie Thr Gly Leu lie Leu Thr 
■ . ■■ 400; : . 

AAC ATG AGC AAG ACC ACC GAG .124 5 
Asn Met Ser Lys Thr Thr Glu .. 

410 " : 415 

GGA GAT ATG AGG GAC AAT TGG .1284 
Gly Asp Met Arg Asp Asn Trp 
425' ■ 

TAT AAA GTA GTA AAA . ATT GAA 132 3 
Tyr Lys Val Val Lys lie Glu 
435 440 
ACC AGG GCA AAG AGA AGA GTG 1362 
Thr Arg Ala Lys Arg; Arg Val ; 

450 ■ ■ 
GCA GTG GGA ATA GGA GCT GTG 1401 
Ala Val Gly lie Gly Ala Val 
465- 

GCA TAA AGC TTC TAG A 1435 , 

Ala Xaa Ser Phe Xaa ; ■; 
• 475 478 - • - 

4 0 ,. . ,. ■ ■ : " :/ ' 

addition to th^ 1 isting in ; Table i; - Figure: 3 shows 
the alignment of the amino acid sequences of the clones 
: of each of the seven isolates . corresponding residues 
from various clones are in boxes. In the figure, the 
45 kmino acid sequences are aligned against MN-rgpl20 
(SEQ. ID. NO. 29) . 

In one embodiment / a gpl2 0 polypeptide of this 
invention has the same amino acid sequence as the 
sequence of one of the breakthrough isolates . In 
50 another embodiment, the amino acid sequence is 

truncal described in detail hereinafter . In 

; : another embodiment.; a gpl20 polypept ide; sequence of . 
this invention contains a substitution, insertion, or 
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deletion (alteration) of one or more amino acid£ in the 
sequence of a breakthrough isqlat^y : Usiiei 1 ly , with the : 
; .Vexeeptioh of amino acids that are not . fbresent i a 
truncated amino acid sequence and -eliminate an epitope, 
5 a gpi20 polypeptide of this: inventioh Will include 
alterations in the amino acid sequence of a 
breakthrough isolate that do not alter the 
polypeptide's ability to induce the .same neutralizing 
antibodies as the amino acid sequence of the isolate. 
10 In general, substitutions in the amino acid 

•sequence of a gpl20 polypeptide of this invention are 
conservative substitutions, particularly for amino acid 
residues in the V2 , V3., and C4 domains of gpl20 , which 
domains contain neutralizing epitopes. However, non- 
15 conservative substitutions, particularly in domains 
that do not contain neutralizing epitopes are 
■contemplated. 

Conservative substitutions replace an amino acid 
. : : with an amino acid of similar size and character . For 
20 exam Pl e Y a hydrophobic residue or hydirbphilic residue 
is replaced with another hydrophobic residue or 
hydrophilic residue, respectively. Amino : acids can be 
divided into the following groups: positively bharged 
residues (K, Rand H); negatively charged residues 
25 (D and E) ; amides (N and Q) ; aroma tics ( F, Y, and W) ; 
hydrophobics (P, G, A, V, L, I , : arid M) ; and uncharged 
residues (S* and T) . Usually, residues within a group 
are replaced with another member of the group. ■ 

In one embodiment, critical amino acid residues in 
the V2 , V3, and C4 domains of gpl20 are identicai to 
the corresponding residues in a b r ea k t h r ougi h . i s o late 
sequence. Critical amino acid residues in the V2 , V3 , 
and C4 domains of gpl2 0 are: described in the 
experimental section. In another embodiment;, all amino . 
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acid residues in the V2 , V3 :/ and G 4: domains, of gp!2 0 
ar£ identical to correspond ing residuesvin a : 
breakthrough isolate sequence . 



; : 5^ : ; Oligonucleotide Encoding apl2 0 f rom Breakthrough 
Isolates ■ 

■ : The present invention also provides novel 
. o 1 i g onu c 1 e o t i d e s encoding gp!20 froin the breakthrough 
isolates which can be used to express gpl2 0. An 

10 oligonucleotide of this invention encodes a polypeptide 
of this invention. The ol igonucleotide. can be DNA or 
RNA f usually DNA . Although numerous nucleotide 
sequences can encode the same amino acid sequence due 
to the degeneracy of the genetic code> conveniently, 

15 the oligonucleotides of this invent ion in elude a 
nucleotide sequence of a breakthrough isolate as 
illustrated in Table 1 (Sequence ID Nos. 2,4, 6, 8, 
TO,. 12, 14 , 16, 18, 20, 22, 24, 2 6, and 28) . Usually, 
an oligonucleotide of this invention is*: less than about 

20 5 kilobases (kb) , preferably less than about 3 kb. 

To express the encoded amino acid sequence, the 
oligonucleotide can be inserted into a transcription 
unit. The transcription unit can be inserted into a 
; :; ('^laSmidVf 6 : r' production of cell lines/ inserted into a 

25: : virus (e .q. ; vaccinia) or can be used directly as a DNA 
vaccine. Suitable transcription un^ 
of vaccine proteins are well known; A preferred 
expression vector, designated p s v 1 6 B 5 , i s i 11 u s t r at e d 
in Sequence ID No. 32 . The vector includes an HSV-1 

30 : gDi signal sequence joined to a linker sequence. The 
gp!20 nucleotide sequence to be expressed starts with 
. V V'" : ph£'"kpt\ I site of the gene. Since all gpl2 0 or gpl60 
sequences contain this site, any gpl 2 ©.nucleotide 
sequence can be analogously inserted into the vector 

35 >iid expressed ... The vector ends with a polyA tail from 
SV40. 
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In additidh to being useful tb express a 
L.r • •polypeptide sequence of this invention ; the 

oligonucleotides of this i nv en t i on cati a Is 6 be used in 
diagnostics to detect HIV isolates. For example , the 
5 oligonucleotide or a portion thereof encoding a 

neutralizing epitope can be used in branched chain DNA 
diagnostics or as a probe in in situ hybridization 
■ • studies. 

10 Vaccine preparation 

A gpl20 polypeptide of this invention from a 
selected breakthrough isolate(s) in a ^.suitable carrier 
is used to make a subun.it vaccine. 1 The polypeptide can 
be used alone, but is generally administered in a 
15 multivalent subunit vaccine that includes gpl20 MN . In 
addition to one or more gpl20 polypeptides of this 
invention, the vaccine generally includes the MN 
/polypeptide (hereinafter, MN-rgpl20) v The vaccine 
: usually includes about 3 to about 5 different gpl2 0 

: 20 polypeptides, but 30 or more different gpl20 
polypeptides can be used. 

Preparation of gpl20 polypeptides for use in a 
vaccine is well known and is d^scrib^d hereinafter 
With the exception of the use of . the^s HIV 
; 25: isolate, the gpl20 subunit vaccine prepared in the 

method does not differ from gp!20 subunit vaccines of 
the prior art . • 

As with prior art gpl20 subunit vaccines, gpl20 at 
the desired degree of purity and at a sufficient 
30 concentration to induce antibody formation is mixed 
with a physiologically acceptable; carrier. A 
physiologically acceptable carrier is nontoxic to a 
recipient at the dosage and concentration employed in 
the vaccine. Generally, the vaccine is formulated for 
35 injection, usually intramuscular or subcutaneous 
. . .injection. Suitable carriers for ; injection include 
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sterile water r but preferably are physiologic salt 
solutions; such as normal saline or buffered: salt 
solutions such as phosphate-buffered saline or ringer's, 
lactate. The vaccine general ly contains an adjuvant;: ; 
5 Useful adjuvants include QS21 (Quillaja saponaria, 
commercially, available from Cambridge Biotech , 
Worcester:, MA) , which stimulates cytotoxic T-cells ,. and 
alum (aluminum hydroxide adjuvant) . Formulations with 
different adjuvants which enhance cellular or local ; 

10 immunity can also be used. In particular, 

immuhopoteht iator s such as cytokines can be included in 
the vaccine. Examples of suitable immunopotentiating 
cytokines include interleukins, such as inter leukin-2 
(IL-2) and inter leuk in- 12 ( IL-12 ) , and tumor necrosis . 

15 ■ ' fact or --alpha (TNF^d) . 

: . ; Additional excipients that can be present in the 
vaccine, include low molecular weight polypeptides (less 
than about 10 residues) , proteins, amino acids, 
carbohydrates including glucose or dextrans, chelating 

20 agents such as EDTA, and other excipients that 

stabilize the protein or inhibit growth of ; . .. ; 
microorganisms. 

The vaccine can also contain other : HIV proteins. 
In part icu lar, ^ gp4 1 or the extracellular portion: of ■ . 

25 gp4i or HIV- 1 core proteins such as P24 , PIT and P55 

can be present in the vaccine. Although the amino acid 
sequence of gp41 is more conserved than that of gpl2 0 r 
gp4 1 contains neutralizing epitopes . Preferably, any 
gp41 present in the vaccine is : from an HIV isolate 

30 present in the vaccine, gpl60 from an isolate used in 
the vaccine can replace gpl2 0 in the vaccine or be used 
together with gpl2 0 from the isolate. A iter natively, 
gpl60 from a different isolate than those in the 
vaccine: can additionally be present in the vaccine. 

35 Vaccines according to the invention can also 

contain one or more soluble gpl2 0 polypeptide 
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sequences , or fragments thereof , I n ; conibina t ion with an 
Engineered virus spec i f ica lly des igned: to express, 
proteins tha^^^^ induce a cytotoxic : T-cel 1 response . . 
^ Suitable ^^^e viruses are derived from/ : f;or " 

• . • :: '5 - • example, Canary Pox virus, vaccinia viruses, attenuated 
human herpes viruses (such as, e.g., herpes simplex 
viruses) , and Varicella Zoster; Exemplary engineered 
: viruses are modified to express . any HIV: protein capable 
of inducing a cytotoxic T-cell response , such as those 

10 described above. Typically, immunization with the 
gpl 2 0/ engineered virus vaccine is followed by 
: administration of one or more doses of the gpl 2 0 

polypeptide sequence(s) to boost the immune response. 
If desired , viruses can be engineered to express one or 

15 more gpl2 0 polypeptide sequences of the invention, or 

fragments thereof, and used in vaccines with or without 
soluble gp 1 2 0 polypeptide sequences. 

Vaccine formulations generally include a total of 
about 3 0.0 to 600 Mg of gpl20, convenient ly in about 

20 1.0 ml of carrier. Preferred formulations include use 
of twice the weight of a gpl20 polypeptide in twice . 
as600 jug alum. However, f ormu 1 at ions • ha ving sma 1 ler 
amounts (e.g.; 5 0 jig per dose) are also used, generally 
^ w 1 th alum or other ad juvants . The amount ; of gpiZO for 

25 any isolate present in the vaccine will vary depending ' 
on the immunogenic ity of the gpl20:;, -for example/ gp.12.0 
from some strains: of HIV may be less immunogenic than 
gpl20 from the MN strain (Sequence ID No; 29) . If two 
strains having different i mm u n o g e n i c i t y are used in 

30 combination^ empirical titration of the amount of each 
virus would be performed to determine; the percent of. 
the gpl2 0 of each strain in the vaccine . For isolates 
having similar immunogenic ity , approximately equal 
amounts of each isolate 1 s gpl20 would be present in the 

35 vaccine. For example, in a preferred embodiment, the 

vaccine includes gp!20 from the MN and. a strain of this 
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.'• invention at: concentrations of about ; 300 jiq per strain : 
in about ivb ; mr : of ; c 

gplio from about 3 0 isolates, about 10 to about 50 jxg 
can be used. Methods of determin ihg the r e lati ve 
5 amount of ari: immunogenic protein in multivalent ; 
vaccines are well Known and have been used, for 
example, to determine relative proportions of various 
isolates in multivalent polio, vaccines. 

The vaccines of this invention are administered in 

10 the same manner as prior art HIV gpl20 subunit 

vaccines . In particular , the vaccines are generally 
administered .at Q, : 1, and at 6, 3 or 12 months., 
depending, on the protocol. A preferred protocol 
includes administration at 0,1., 6, and 12 months,. 

15 Following the immunization procedure, annual or 

bi-annual boosts can be administered.. However, during 
the immunization process and thereafter, neutralizing 
antibody levels can be assayed and the protocol 
adjusted accordingly. 

20 The vaccine is administered to uninfected 

individuals . In ; addition, the vaccine can be" ; 
administered to" seropositive individuals to augment 
immune response to the: virus, as with prior art HIV :: , 
vacq i n e s 1 t - is; a 1 s o contemplated that DNA encoding 

25 the strains of gpi2 0 for the vaccine can be 

administered in a suitable vehicle for expression in 
the host V In this way , gpl2 0 can be produced in : the 
infected host, eliminating the need for repeated 
immunizations . Preparation of gpl2 0 expression 

30 vehicles is described hereinafter. 

Although the gp!2 0 isolates described herein can 
be used asa vaccine as described above, the amino acid 
sequences can also be used alone or in combinations in 
the same type of formulation for use as an immunogen^ 

35 to induce antibodies that recognize the isolate (s): 

present in the; immunogen. Immunogens are formulated in 
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the same manner as vaccines and can include the same 
v^-.ya'XCi^ r /Anteiteodies induced by the::;immunogens 

; can; be used in a diagnostic to detect the HI in 
t:he immunogen or to affinity purify th£ strain . 

qpl20 Polypeptide Sequences and Chemokine Receptors 
While CD4 is the primary cellular receptor for 
HiVri, it is not suff icient for entry of HIV-1 into 
cells. Co-receptors required in conjunction with CD4 

10 have been identified. These co-receptors are members 

of the chemokine receptor family of seven- transmembrane 
G-protein coupled receptors. The chemokine super family 
is subdivided into two groups based on the amino 
terminal cysteine spacing. The CXC chemokines are 

15 primarily involved in neutr ophi 1-mediated ihf lammat ion , 
and the CC chembkines tend to be involved in chronic 
inflammation. At least five CC chemokine receptors', 
designated CC^CKRl-5 (also known in the. art as CCR1-5) , 
and at least four CXC chemokine receptors, designated 

20 CXC-CKR1-4 (also known as CXCR- 1 -4 ) , have been • 
/: identified. 

CXC-CKR-4 (CXCR-4) , which has also been ca 1 led the 
a lpha -rchemok i ne receptor f us in , serves ':a;s an ent ry 
cof actor for T-cell-tropic HIV-l strains ^CC .^CKR^' 5 
25 :CCC-R5) , which has been called beta-chemok ine receptor , 
V together with its related family members , such as CC- 
CKR-2b and CC-CKR3 ., serve as entry cof actors : for 
: macrophage-tropic HIV-l strains. T-cell-tropic strains 
can infect primary T-ceils and T-cell 1 i n e s , bu t not 
3 0 macrophages, whereas macropha 

infect macrophages and primary T -cells, but not T-cell 
lines. T-cell- and macrophage-tropic strains are. 
: -discussed .more fully in Deng et. al.., Nature 381 : 661- 
666 (1996) , which is hereby incorporated by reference 
3 5 in its entirety.. Examples of T-cell- tropic strains 
include laboratory isolates, such as II IB and. MN . 
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Macr ophage- tr op ie .strains include .primary isolates, v : . :: . : ..%-: ; -- 
including but ; not l iinited to A244 , GNE6 , GNE8 , and 
breakthrough Wruse^ froiii va cc i n e e s immunized with 
gpl2 0-based yacc ihes . Dual-tropic strains can use both v 
5 : types of co-receptors/ entering cells : via CXC-CKR-4 or ; 
via one or itvor^ GC-CKR family members, preferably CC- 
CKR-5, CC-CKR-2b : , or CC-CKR-3 . While the present 
invention is not intended to be bound or limited by any 
one theory; the entry of T-cell tropic and macrophage r 
10 tropic HIV-1 strains is believed to provide a unifying 

explanation of the differences in cell tropism between : , 
viral strains/: the resistance to HIV-1 infection by 
many CD4-tra.nsf ected. nonprimate cells, and the HIV-l- 
infection resistance of a portion of the human 

15 population. . 

Accordingly, in one embodiment is a vaccine 
containing: (1). a first gpl20 polypeptide sequence, or : : 
fragment thereof/ from a macrpphage-tropic HIV-1 strain 
and/ or a second gpl2 0 polypeptide sequence , or fragment; 

20 thereof , from a T-cel 1 tropic stra in , in combination 

with (2) a breakthrough isolate HIV gpl20 polypeptide , ; 
sequence, or fragment thereof, from a vaccinee 
vaccinated with the first and/or second HIV gpl20 
polypeptide sequence. ; Preferably, the vaccine includes 

25 at least two gpl2 0 polypep^ sequences that bind ."tov.' ; :; 
different chemokine receptors. in one embodiment , the 
vaccine includes f irst and second gpl20 polypeptide : : 
sequences that ibind to different chemokine receptors. 
In addition, the breakthrough isolate gpl20 polypeptide 

10 sequence can bind to a different chemokine receptor ; 
than the chemokine receptor (s) bound by either or both 
of the first- and second gp!20 polypeptide sequence (s), 
A preferred T-cell tropic strain is a laboratory 
isolate /most preferably MN. Preferred macrophage- 

35 tropic viruses; for use in the invention are GNE6 and 
GNE8, which are representative of the breakthrough ; 
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viruses disclosed herein and differ from MN in ^hat: 
their gpl2 Gs induce the formation of antibodies that v 
recognize (e . g . / the V3 :;vdomad«> : ' V -V- " * ; ■ 

involved in binding to CG cheihokine receptors; such -as ^ : 

•5 CXC-CKR-5.V 

In one embodiment, HIV infection is prevented by 
administering one or more chemokine receptor-binding 
gpl20 polypeptide sequences, or f ragment (s) thereof 
containing appropriate chemokine receptor-binding 

10 domains, in a vaccine , such as those described above. 
Preferably /the vaccine also includes one or more 
CD4 -binding gp!20 polypeptide sequences or appropriate 
fragments thereof . Such vaccines induce anti-HIV : 
antibodies that inhibit viral . gp 1 2 0-chemok i ne receptor . 

15 or -GD4 binding . In addition, such gpl2 0 polypeptides 
can directly inhibit HIV infection by binding to one or 
more co-receptors for HIV infection, such as CD4 or a 
chemokine receptor, thus providing a prophylactic or 
therapeutic effect in treating HIV infection. . 

20 Preferably, gpl20 polypeptide sequences useful in thisv 
regard contain the T-cell binding (TCB) domain. ; ; 

Various uses of chemokine receptor-binding gp!20 
polypeptides are discussed below with regard to the CC 
chemokine receptor family. However, those skilled; -in / 

25 the art; reeogn i ze that this discussion applies equally V 
to CXC; chemokine receptors that act as c of a c t o r s in HI V 
infect ion. 

The gpl20 polypeptides can be used as a 
composition containing one or more gpi2 0 polypeptides , 

3 0 as described for use as a vaccine or immunogen. The ; 
compos ition can be administered , prophy Tactically or .; 1 v 
therapeutically, to a patient at risk of infection or 
in need of such treatment using the dosages and routes 
and : means of administration described herein. However, 

35 chronic administration may be preferred and dosages can 
be ad j usted accordingly . It is noted that in vivo 
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administratiph can also induce antibodies that bind ; 
virkl gpl20 , further inhibiting virus binding to 

GC-CKR. \y'; :v ; .y:,' 

The gpl20 polypeiptides can alsp be used in 
5 screening assays to identify antagonists of CC-CKR. 

For example, candidate antagonists can be screened for 
inhibition of binding of gplZO to a CC-CKR CC-CKR 
receptor that, is isolated and attached to a surface 
(e.g. , plastic dish) or recombinaritly or naturally 

10 expressed on the surface of a eel 1 . Antagonists can 
either bind gpl2 0 or bind receptor. Preferred 
candidate antagonists include gpl20 compounds, small 
gpl20 peptides (5 to 20 amino acids in length , 
preferably 7 to 10. amino acids in length) or 

15 peptidomimetics of gp!2 0 that bind receptor, monoclonal 
antibodies that : bind gpl20, and: small organic molecules 
that bind either gpl2 0 or receptor. 

The antibodies induced by the gp!20 polypeptides . 
can also be used to induce anti-idiotype antibodies 

20 that bind CC chemokines . These anti-idiotype 

antibodies can be screened for binding to. an anti-gpl20 
polypeptide antibody and inhibiting gp!20 from binding 
. CC-CKR receptor . Such a nt i - i d i o t y pe a n t i bod i es mimic 
• gpi2 0 by binding : to C C -CKR; receptor . Such antibodies r 

25 preferably human antiboc3ies| can be obtained in a 
number of. ways f such as human antibodies from 
combinatorial libraries (e.g. , Burton et al. Adv. 
Immunolo. (1994) 57: 191-2 80) . It is now possible to 
produce transgenic animals (e.g., mice) that are 

3 0 capable, upon immunization, of producing a full 
repertoire of human: antibodies in the absence of 
endogenous immunoglobulin production. For example, 
homozygous deletibn of the antibody heavy-chain joining 
region ( JHj gene in chimeric and germ-line mutant mice 

3 5 results in complete inhibition of endogenous antibody 
production. Transfer of the human germ-line 
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immunog lobu 1 in gene array in such germ-line mutant mice 
results in the production of : human ant ibodies upon 
. antigen chailenge as described in Jakobovit is et al ♦ , 
Proc. Nati . Att USA 90 : 2551 (1993 ) ; Jakbbbvits 

5 et al. , Natur^ ^6Z:255-25B (1993) ; Bruggermann et ai. / 
Year in iihmunb. 7: 33 (1993) • 

Alternatively > phage display technology as 
described by McCafferty et al. , Nature 348:552-553 

(1990) can be used to produce human antibodies and 
10 antibody fragments in vitro from immunoglobulin 

variable (V) domain gene repertoires from un immunized 
donors.. According: to this technique, antibody V domain 
genes are closed in- frame either into either a major or 
minor coat protein gene of a filamentous bacteriophage, 

15 such as Ml 3 or fd, and displayed as functional antibody 
fragments on the surface of the phage particle. 
Because the filamentous particle contains a single- 
stranded DNA copy of the phage genome/ selections based 
on the functional properties of the antibody also 

20 result in selection of the gene encoding the antibody 
exhibiting those properties. Phage display can be 
performed in a variety of formats as reviewed by, for 
example, Johnson > et al., Current Opinion in Structural . 
Biology 3 : 564-571 ( 1993 ) . 

25 Several sources of V-gene segments can be used for 

Phage display. Clacksori et al,., Nature, 352 : 624-628 v 

( 199 1 ) isolated a diverse array of ant i-oxazolone 
antibodies from a small r andbm comb in a t o r i a 1 library of 
V genes derived from the spleens of immunized mice. A 

30 repertoire of V genes from uri immunized human donors (or : 
embryonic cells) can be constructed . It has been 
demonstrated that antibodies to a diverse array of ; : V 
antigens (including self -aritigiens) can be isolated 
essentially following the techniques described by Marks 

35 e t al. , J/ Mpl i Biol* ,222: 581-597 ( 1991) , or Griffith 
et al • , EMBO J. , 12 : 725-734 (1993) . 
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In a natural immune response , antibody genes 
accumulate mutati^ rate ( sbmat ic 

hypermutat ioh ) v : Some of the changes introduced confer 
> higher affinity,: a nd B jce i 1 s d i s p 1 a y i n g )) i gh - a f f i p ;L t y 
5 surface i mmunog 1 obii 1 in are preferentially replicated 

and differentiated during subsequent antigen challenge. 
This natural process can be mimicked by employing the 
technique known as "chain shuffling" (Marks et al. , 
; Bio/Technbl . 10: 779-7 83 [1992]). In this method, the 
10 affinity of "primary ■'■'■:* human antibodies, obtained by . 

phage display can be improved by sequentially replacing. 
•\ the heavy and light chain V region genes with 
repertoires of naturally occurring variants 
(repertoires) of V domain genes obtained from 
15 . unimmunized donors. . This technique allows the 

production of antibodies and antibody fragments with 
affinities in the nM range. A strategy for making very 
large phage antibody repertoires has been described by 
Waterhouse et al. : , Nucl; Acids Res. , 21: 2265-2266 
20 (1993) . 

Accordingly , antibodies that bind CC-CKR can be 
obtained by screening antibodies or fragments thereof 
expressed on the surface of bacteriophage in 
combinatorial lfbraries or in other systems as 

25 described above ;with -.a" gpl2 0 mondcldna 1 antibody that 
inhibits gpl20 binding to receptor . 

In addition to screening antibodies with a gp-120 
antibody, random .or combinatorial peptide libraries can 
be screened with either a gpl20 antibody or the gpl20 

3 0 compounds of the invention. Approaches are available 
for identifying peptide iigands f rom 1 ibrar ies that 
comprise large col lections of peptides, ranging from 1 
million to 1 billion difference sequences, which can be 
screened using monoclonal antibodies or target 

3 5 molecules . The power of this technology ^tems from the 
chemical diversity of the amino acids coupled with the 
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large number* of sequences In a library . See for ; 
example , Scott et ai; V :6ur : >.' ; open . B i o t e ch n o 1 . • 5 ( 1 ) 4 0-8 
(1994) ; Kenan et '^l^Tr^nds fiiochem, Sci, (1994 ) 
19 ( 2 ) : 57 - 64 . :hcp$i&inq'I& the monoclonal antibodies , 
preferably human monoclonals or fragments thereof, 
generated as d i s cu s s ed h e re in, find use in treatment by 
inhibiting or treating HIV infection or disease 
progression, as well as in screening assays to identify 
additional ph a r ma c e u t i c a Is . 



Production of gpl20 

gpl20 for a vaccine can bis produced by any 
suitable means > as with prior art HIV gpl2 0 subunit 
vaccines. Recombinant ly-produced or chemically 

15 synthesized gpl20 is preferable to gp!20 isolated 
directly from HIV for safety reasons. Methods for 
recombinant production of gpl20 are described below. 

Oligonucleotides encoding gpl2 0 from breakthrough 
isolates and capable of expressing gpl2 0 can be 

20 prepared by conventional means. For example, the 

nucleotide sequence can be synthesi zed . Alternatively, 
another HIV nucleotide sequence encoding gpl20 can be 
used as a backbone and altered at any differing 
residues as by site-dir^ mutagenesis . 

25 Site-directed mutagenesis is . described in Kunkel et al, 
Proc. Natl. Acad.Sci^j 82 : 488-4 9 2 (1985) and 

Zoller et al, Nuc. Acids Res . 10:6487-6500 (1982) and 
.. is well known. 

In a preferred embod iment , the nucleotide sequence 

30 is present in an expression construct containing DNA 
encoding gpl2 0 under the transcriptional and 
translational control of a promoter for expression of 
the encoded protein. The promoter can be a eukaryotic 
promoter for expression in a mammalian celi. In cases 

35 where one wishes to expand the promoter or produce 
gpl2 0 in a prokaryotic host:, the promoter can be a 
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: prokaryot ic promoter . Usually a strong promoter is 
employed to provide ftigh-leye^ 

expression i . . ' 

; Tl>e; expression construct ^an be part of a vector 
5 capable of stable ext f a chr omo ^ oma 1 maintenance in: an 
appropriate cellular host or integrated into 

host genomes. Normally, markers are provided with the 
expression construct which allow for selection of a 
host containing the construct.. The marker can be on . 

10 the same or a different DNA; molecule, desirably, the 
same DNA molecule. 

The expression construct can be joined to a 
replication system recognized by the intended host 
cell. Various replication systems include viral 

15 replication systems such as those from retroviruses, 

simian virus, bovine papilloma virus , or the like. In 
addition, the construct may be joined to an amplif iable 
gene, e.g. the DHFR gene, so that multiple copies of 
the gpl20 DNA can be made. Introduction of the 

20 construct into the host will vary depending on the 

construct and can be achieved by any convenient means. 
A wide variety of prokaryot ic and eukaryotic hosts can 
be employed for expression of the proteins. 

Preferably, the gpi2d is expressed in mammalian 

25 cells that provide the* saiiie g ly cosy la t and disulfide 
bonds as in native gp 12 0 .V y Express ion of gp 12 0 and 
fragments of gpl20 in mammalian . cells as fusion 
proteins incorporating N^termiria I sequences of Herpes 
Simplex Virus Type 1 (HSy-1) glycoprotein D (gD-1) is 

30 described in La sky, L. A. et al. , 198 6 (Neutralization 
of the AIDS retrovirus, by antibodies to a recombinant •• 
envelope glycoprotein) Science 2 33: 2 09-2 12 and Haf f ar , 
O.K. et al. , 1991 (The cytoplasmic tail of HIV- 1 gpl60 
contains regions that associate with cellular 

35 membranes. ) Virol . 18 0 : 4 39-441 , respectively; A : 

preferred method for expressing gp!20 is described in 
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the examples. In the examples, a heterologous signal : 
sequence was used for convenient expression, of the 
protein . However ; the protein can also be expressed 
using the native signal sequence • 
5 Ah isolated ; pur if ied gpl2 0 polypeptide having one 

of the amino acid sequences illustrated in Table 1 can 
be produced by conventional methods. For example/the 
proteins can be chemically synthesized . In a preferred 
embodiment, the proteins are expressed in mammalian 
10 cells using an expression construct of this invention. 
The expressed proteins can be purified by conventional 
means. A preferred pur if i cat ion procedure is described 
in the examples. 

15 - qp 1-2:0 Fragments 

The present invention also provides gpl20 
fragments that are suitable for use in inducing 
antibodies for use in a vaccine ^formulation. A 
truncated gpl2 0 sequence, as used herein, is a fragment 
20 of gpl20 that is free from a portion of the intact 

gp!20 sequence beginning at either the amino or carboxy 
terminus of gpl20. A truncated gpl20 sequence of this 
invention is free from the C5 domain. The C5 domain of 
^ gpl2 0 is a major imm u nog e n i c sit e .; of the molecule.* 

25 However , antibodies to the region do not neu tra 1 i z e 

virus. Therefore, elimination of this portion of gpl2 0 
from immunogens used to induce antibodies for 
serotyping is advantageous.: : 

In another embodiment, the truncated gpl20 
30 sequence is additionally free from the carboxy terminal 

region through about amino acid residue 4 53 of the 
■ gpl20 V5 domain. The portion of the- V5 domain 
remaining in the sequence provides a convenient 
. restriction site for preparation of expression 
35 constructs. However, a truncated g.pl20 sequence that 
: is free from the entire gp!20 V5 domain is also 
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suitable for use in inducing antibodies. ; 

In addition, pbiirtibns of the amrio terniihus of 
-*= -~. - V- - - -\V= :V: :.gpl20 can also be : elirninated f rom t he : t run cat ed gpl2 0 
. sequence. In particular ;/ the truncal 
5 : can be free from the gpi20 ; signal sequence . ;The 

truncated gpi20 sequence can be free from the carboxy 
terminus through amino acid residue 111 of the gpl2 0 CI 
domain, eliminating most of the CI domain but 
preserving a convenient restriction site. However, the 
10 portion of the CI domain through the V2: cysteine 

residue that forms a disulfide bond can additionally be 
removed,: so that : the truncated gpl20 sequence is free 
from the carboxy terminus through amino acid residue 
117 of the gpl20 CI domain. In a preferred embodiment, 
15 the truncated gp!2 0 sequence is free from the amino 

terminus of gpl2 0 through residue 111 of the CI domain 
.and residue 4 53 : through the carboxy terminus of gpl2 0. 

The truncated gpl20 sequences can be produced by 
recombinant engineering,, as described previously . 

2 0 Conveniently, DNA encoding the truncated gpil20 sequence 

is joined to a heterologous DNA sequence encoding a 
. . signal sequence. 

It is understood that the application of the 
' teachings of the present invention to a : -specific 
25 . ■ problerii or situation is within the capabilities;^ of one 
having ordinary skill in the art in light of the 
teachings contained herein. Examples of the products 
of the present invention and representative processes 
for their isolation, use, and manufacture appear below, 

3 0 but should not be construed to limit, the invention. 

All literature citations herein are expressly 
incorporated by reference. 
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Specimen collection from human w Blood 
' V: : Was : collected f rb^ who 

'''K^^-^f 0 ^' with Hiy-r w^ 

: :; (NIH Protocol AVEG 016 ) - and Phased I 
3: MEG: 201) HIV-1 vaccine trials sponsored ^ 

National Institutes of Health (NIK) . The demographics 
of the subjects in the study, and the study design have 
, V: : been described in McElrath; Seminars in Cancer Biol. 
6: l-ii (19.95); McElrath et aL ; Abstracts Zrom Eighth 
Annual Meeting of the National Cooperative Vaccine 
development Groups for AIDS . Bethseda, MD 216 (1996) . - 
Specimens were obtained according to ah informed 
consent protocol approved by the institutional review 
boards of the participating institutions. in the: 
15 experimental section, the time of HIV-i infect ion is 

specified with regard to data provided by the NIH AIDS 
Vaccine Evaluation Network where PCR : ( FtNA) ;.' and/ or 
/serologic assays were used to detect Hiv-I infection. 

20 Sample preparation for cloning HIV-i envelope 

glycoproteins. Peripheral blood mononuclear .cells 
(PBMCs) from HIV-i infected vaccinees were prepared 
from heparini zed venous blood by FICOLL-HYPAQUE 
[ gradient centrif ugation . Cel 1 number andviabn 
2 ^ wer ^ determined. After separation, PBMC^ w^^ ; 
; . ; twice in phosphate-buffered saline and suspended at a 
: cell density of 6x10" cells /ml in PGR lysis-buffer 
: (50 mM KC1/ 10 rnM Tris- (pH 8 . 4) , 2 . 5 mM MgCl,, 0 . i mg/ml 
gelatin (Sigma), 0.4 5% NONIDET P40 detergent/ 
0.45% TWEEN 20 detergent (both detergents are 
commercially available from United States Biochemical 
Corp. ) and 0. 06 mg/ml Proteinase K ( G i be o BRL ) to lyse 
the cells . The lysate was incubated at 50-60 ^G for 
1 hour, followed by inactivation of the Proteinase K at 
9 5.0-C for 10 minutes. Ly sates were shipped frozen and 
stored at -70 °C until use. 
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Polymerase chain reaction (PCR) amplif icat ibn . 

Samples W sub j ected to two rounds of PCR :.y: r -. : ^: : : i : \ 
ampl if icatipn using the nested primers described below. ; 

• Jri the f irst round , 2 5 /xl a 1 iquots of PBMC ly sates [■.••£ 
5 ( conta ining about 1 jxg genomic DNA ) were mixed ^ an . 
equal volume of a PGR reaction mix containing 400 /xM 
each dNTP , 200 /ig/ml BSA. (Sigma Chemical Corporation/ 
RIA grade) and about 100 pmoles of each primer in 50 mM • 
KC1; 2 0 mM Tris (pH 8.4) and 3 mM MgCl 3v \ After an 

10 initial 10 minute denaturation step at 9 ; 5°C, 5 units of 
Tag polymerase ( AMPLITAQ , Perk i n Elmer Cetus) were 
added during an 55 °C soak step, and samples were . 
overlayed with mineral oil. 

The PCR profile was as follows.: 2 cycles haying.. 

15 1 minute at 55 °C, 2 . 5 minutes at 72 °C and 1 minute : at . 
94 °C, followed by 28 cycles with 30 seconds, at 55 °C,:. : 
2 ; 5 minutes at 72 ° C and . =4 5 seconds at 94^G ; and ah 
extension step at 72 °C for 5 minutes* : 

A 1 iquots of 10 fil from the first-round reactions 

20 Were re-amplified with appropriate nested primers, in a 
final reaction volume of 100 /il, using either the 
reagents and profile described above or the reagents 
and profile described in the PCR Optimizer Kit 
:(Invitrogen. ) PCR reaction products were purified ^-y/'.y- : [ 

2 5 - using ^Q^l^ columns : (Qiagen Inc ; ) The primer , 

pair used in the first round was either 12 0.os.F 
( 5 1 - ggg a a t t eg ga t c c AG AG C AG A AG A C A GTG G C A ATG A w i t h 
homologous sequence at position 624 8-62 70 of HIVPV22;) 
f SEQ^ : ID. NO. 34) or JM11A ■ 

3 0 (5 1 •dtcgag-CTCCTGAAGACAGTCAGACTCATCAAG at .position 

6048-6074) (SEQ. ID- NO. 3 5) in the forward direct ion 
[Kusumi et al. ; J . Virol .y 66:875 (1992) ] combined with 
120 . OS . R ( 5 f - g gtctagaagctttaGCC CAT A GTG CTT C CTG CTG CT - C C 
at position 78 3 6-78 59), (SEQ. ID- NO. 36) in the reverse 
35 direct ion. The internal nested primers were 120.BX. F 
( 5 1 -gggcggatcctcgaGGTACCTGTRTGGAAAGAAGCA at position 
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6389-6410 r R: A or G) (SEQ. ID^ NO." 3.7) .and 12Q. is^R £ 
■ ( 5 '^gtct^aag6ttt^ at; ; pos:ition ; 
. : • 7819-784p rrT : or C) ( SEQ ; ID. NO. 38) Heterologous; 
primer Sequences are shown in lower case : letter's:;'^ 

•'iSubeloning of PCR products and the expression" "of'!"' 
recombinant envelope glycoproteins as fusion proteins £ 

The HIV-i envelope glycoprotein gpl2 0 sequences were 
cloned arid expressed as chimeric genes and fusion 

10 proteins, where; the signal sequence and 27 amino 'acids 
from the mature N terminus of herpes simplex virus 
type 1 (HSV-1) were fused to the. N-terminal sequences 
Pf the gpi2 0 genes, corresponding to amino acid 13. of 
the mature gpl2b : sequence. PCR products containing 

15 gpl2p sequences from the breakthrough specimens were 
cloned into pRK5 expression plasm id as chimeric genes 
using combinations of restrictions sites engineered 
into the .heterologous PCR primer . tails and the Xho I 
site engineered into the N-terminal sequence of 

20 HSV-1 gD. 

the resulting double-stranded DNA was sequenced 
with Sequenase and the dGTP Reagent Kit (United States 
Biochemical Corp n ? Sequences from glycoprotein D Were 
provided to enhance expression and tb ; provide a f lag: ■ : : 

25 epitope., to: f acilitate protein .analysis , as described-in 
Berman et al.,; j. Virol . 7 : 4 464-9 ( 1992 )..; Nakamura et/- • 
al.; AIDS, and Hum Retroviruses 8:1875-85 (1992) ; and 
Nakamura et al. ; J. Virol. 67:6179-91 (1993) . 

Briefly/ isolated DNA fragments generated. by the 

30 PCR reaction were. ligated into a plasmid (pRK.gD-5, 
pRKgDstop) designed to fuse the gpl2 0 fragments, in 
frame , to the 5 ' sequences of the glycoprotein b (gb) 
gene of Type 1 Herpes Simplex Virus (gD-l)and the 3 • 
end to transiational stop codons. The fragment of the 

35 gD-l gene encoded the signal sequence and 25 amino 

acids of thejmature form of" HSV-1 protein. To allow . : V 
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for expression in mammalian eel Is ^ chimeric genes 
f ragih^rit^ :Ve^ cloned into the pRK5 : expressi : prv plasmid- 
: : ( Eaton £t ail. > Biochemistry 291: 8 343^8347 (1986) | th^ 
contained a polylinker with cloning sites and 
5 tr ans iat iona 1 stop codons located between a 

cytpmegalpvirus prpmotor and a : simian virus 4 0 virus 
polyadenylation site. 

The resulting plasmid^ were trans fee ted into; the 
2.93s embryonic human kidney cell line (Graham et al . , 

10 . J . Gen. Virol . 36 : 59-77 (1977)) using a calcium 

phosphate technique (Graham et al . , Virology 52: 4 56-4 67 
( 197 3 ).). Growth conditioned cell culture media, was 
collected 48 hr after, transfection, and the soluble 
proteins were' detected by . ELI SA or by specific : . 

15 r a d i o i mmu n opr e c i p i t a t i on where, metabolically labeled- 
proteins from cell culture supernatants were resolved 
by sodium dodecy i : sulfate polyacry lamide gel ; . 

electrophoresis ( PAGE) and vi sua 11 zed by 
autoradiography as described in Berman et al. , 

2 0 J ♦ Virol . 63 : 348 9-34 98 (1989) and Laemmli, Nature 
227 : 680-68 5 (197 0) . 

Serologic assays. Sera were assayed for 
antibodies to rgplZO,. antibodies to synthetic gpl2 0 : V3 

25 doma in peptides corresponding to sequences from the . 
g p 1 2 0 vW- domain/, and antibodies able to inhibit the 
binding/ -of MN^.rgpl20 to cell surface CD4 using 
serologic assays described in Berman et al. ; -J... :; Virol;. : 
7:4464-9 (1992) ; Nakamura et al.; AIDS and 

30 Human Retroviruses 8 :1875-85 .( 1992 ) ; and Nakamura et 
al-; J. Mirot. 67 : 6179-91 ( 1993)\ Endpoint titers of 
antibody binding to: gpl20 and V3 peptides were 
determined using three fold-serial dilutions of sera. 
The endpoint dilution titer was def ined as the last 

35 dilution that produced an optical density value that; 
was two times higher than the mean of the opticai ;; y^^ 



-59- 



WO 98/01564 



PCT/US97/09690 



; , densities of 1: 5 Q : diluted , pooled , normal human sera . 

Ant i body t Were Calculated by a computer program 

■ that ; interpolate dilutions i ; : 

the inter-assay :CoefMdierit positive 
5 control standard sera was 3 5% . 

Binding of monoclonal antibodies to rgpl2 0 from 
breakthrough viruses ♦ An ELISA similar to that 
described by Moore et a 1 . ; AIDS 3 : 155-63 (1989) Wa6 

10 used to measure the binding of various monoclonal 

antibodies (MAbs) to rgpl20s from breakthrough viruses. 
Briefly, Nunc-Immunb. plates (Maxisorp, certified) were . 
coated (100 /xl at 5 ng/ml in PBS at 4 °C overnight) with 
an affinity-purified sheep polyclonal antiserum to a 

15 peptide at the C terminus of gpl20 (D7 3 24 , 

International Enzymels , : Fa 11 brook , CA) . After washing 
once with PBS-0 r 0;5% : the plates were- 

. blocked with PBS-1. 0% BSA : for 30-60 minutes at room 
temperature. Cell culture supernatants from 2 93s 

20 cells, diluted to contain equivalent amounts of the 

gD-rgpl20 fusion protein, were added and incubated for. 
2 hours at room temperature followed by three washes 
with PBS-0. 05% TWEEN-2 0 detergent. Various MAbs were 
di luted . in PBS-1 . 0% BSA and: ■ 100 of the di luted MAbs : : 

2 5 were added to each well and incubated for 1 hour at 
room temperature. 

The plates were washed 3 times and incubated with 
100 ^tl of a horseradish per ox idase-conjuga ted second 
antibody (goat anti-mouse or anti-human IgG , Cappel) 

30 for 1 hour at room temperature . After 3 washes the 

plates were developed : and the OD Jv: , (optical density at 
492 nm) read in a plate reader. Growth corid i t ioned 
cell culture supernatants : were normalized by dilution 
based on binding by MAb 5B6 which is specific for HSV-1 

35 glycoprotein D fusion protein; 
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Virus neutralization assays. The ability of 
;. vaccitiee sera tb : inh ibit : inf ection of MT4 cells by 

HIV-1 MN was measured in ^ assay where cell 

viability was quarititatied yusing a calorimetric 
5 indicator dye , as described in Robertson et al.; 

j. Virol. Methods. 2D: 195-202 (1988). Briefly, a virus 
stock of HIV-1 MN (obtained; from Dr. Michael Norcross, 
U.S.. Food and Drug Admi hi strait ion) was prepared as the 
clarified supernatant from chronically infected 

10 H9/HIV-1 MN cell culture. H9 cells chronically infected 
with HIV-MN were pelleted and resuspended in one-tenth 
the original volume of medium. Cell-associated virus 
was released by the mechanical shearing effects of 
rapid vortex ing of the cells as described in Wrin et 

15 al.; J. Virol, 69:39-48 (1995) . 

An amount of virus- sufficient to ensure complete 
cell lysis killing in 7 days was incubated with 
three-fold serial dilutions of test ant i sera, and then 
used to challenge MT4 T- lymphoid cells in 10% 

20 FCS/RPMI-164 0 cell culture media. The cultures were 
incubated for 7 days at 37°C in 5% CO., and then cell 
viability was tested by the dye MTT, as described by 
Robertson et al. ; J. Virol. Methods 20: 195-2 02 (.1988) . 
Virus neutral izatiph i endpoints were quant itated by 

2 5 measurement of OD at 57 0-650 nm, and then the endpoint 
titers were calculated as the reciprocal of the 
antiserum dilution giving a signal that was two- fold 
:. above the control signal with unprotected (killed) 
cells. These titers were typically twice those 

30 calculated at 50% protection . 

... Results v. ; 

Immunization history of infected subjects . Since 
1992, 499 adults have ...been * n 
35 phase I trials in low or moderate risk individuals and 
in a Phase II clinical trial involving moderate to high 
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risk individuals . Th e stud i es : d e s c f i be d; he r e i n entail V 
the genetic and immunologic characterization of the 
first seven of nine individuals Wh o became infected 
with HIV-1 through high risk behavi or dur ing the course ; 
5 of these trials. A 1 ist ihg of the tr ia Is and summary 
of the status of the vaccinees is presented in 
Table 2A. A listing of the analysis of the vaccinees 
is presented in Table 2B. 



10 TABLE 2 A 

Description of Vaccinees Infected with HIV-i 
After Immunization with MN-rgpl20 





Study No. 


Case No. 


*Risk Group 


:!: A n t i g e n dp s e / 
Adiuvant . 




016 


C6 


\ M/H 


3 00/QS21 




016 


C8 


M/H 


600/QS21 




016 


C15 


M/H : 


300/QS21 




201 


C7 


M/H. 


600/Alum : 




201 


Cll 


M/H 


600/Alum 


20 


201 


C10 


. M/IDU .' 


600/Alum 




201 •• 


C17 


M/ IDU : : V 


600/Alum 



* - M/H: indicates male homosexual / M/IDU 
indicate male intravenous drug user, 
t - numbers indicate dose : in micrbgrams of 
25 MN-rgpl2 0 injected per : immunizat ion ; QS21 

indicates antigen was? formulated in QS21 adjuvant; 
Alum indicates MN-rgpl2 0 formulated in aluminum 
hydroxide/ 
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' ' ; : TABLE 2 B 
Description of vaccin es infected with **IV-1 



15 





In^ectxpn 


T rt "liar"* t" .l "ifSrVCt ' 
HI j ew Ll ?* : 


'■'■V- : :'T : i : iti : e : .'' Of :■: 


nlhterval 


Case 


Schedule 


■ ' ••bef.ore:"-:'-'. 




: to HIV-1+ 


No. 


(TnbrithsV 


HIV-i+ 


rmbnths V 


f months V 


C6 


0, lylO.5 




: ; y :^M. .00 ; 


2. 00 . 


C8 


.;• Vp.:/.!.: ' 


•.'.::2 : U:' ; :;; ; ; :: 'i 


; 4 . 00 ; 


: : : 3.00 


CIS 


0,1,2 


. ' 3. 




4 . 00 


C7 


0,1,6, 12 . 


• • 3 " : - 


; :: -;.''-9;,-2 5 ' 


3 . 00 


Cll 


0,1,6,12 . 




V'-V ::',:19;. V 50: ' ' 


6.75 


CIO 


0,1,6,19 


■ ■ 3 : 7 . . 


I.9.: 50 ; 


13.50 


C17 


0, 1,6, 18 






' 6.2 5 


. :-nv 


- indicates 


interval between last 


immunization 


. and 


detection of 


H IV - 1 ; • : infect i o n v ; ; ; . 





Three of the infections ^ a Phase I 

trial (NIH Protocol AVEG 2 o i) ; : th^t : c the safety 

and immunogenicity of MN-rgpl20 formulated in; two 
2d. different adjuvants (alum and QS21) ; and four of the 

V:;; ; v : i nf ect ion si • pccUrre d : . in a P h a s <s : : '.t :I ;:^r' : i'a 1 ; ' aimed • at- • • . ■ 

establishing, the . safety of MN r rgpl20 

in var ious high risk groups: ;( e . g ^en©^?- drug; 
users ; hpihosexual and bisexual males^ a partners of 
; : : : 2$ . ; . HIV-i inf ected ind ividuals ) ^^^ : :;;^:^^^^^^^ ; ;^^ : .; 

Of the seven in f e c t i 6 n s s t u d i e d CT a b le 3) , two (C6 
' and C8) occurred after two in j^ : 
■■■'A- and CI 5) occurred after three; in ject ions , and two ( Cll 
... and C17) occurred after receiving scheduled 
; : ,3:q. . :: inject i ons> ■ The interva 1 b e t w e eh rece i v i h g the last ; 
: ; immun i zat ion = and becoming infected: •was;.,2. to 
\ 13.5 months .V . ;= ■ 
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. •:' TABLE 3 
P ak P st Bpbst MN-i:gpl20 Antibb 
in Vaccinees that Became infected with HIV-l 



Injections C6 



G8 



<50 .2185 



01 Cll G10 C17 



79 <50 1890 ; na na 



21539 10125 na 413 . 32696; 7771 7056 

* # 4460 9707 34728 . 11627 1841 

'3 



10 



15 



4 • ; . # • * ■# ■ # - # : 1134 

# .- indicates specimen not analyzed because of 
HIV-l infection."-/. 

na - indicates the sample was not available for 



boldface- indicates unusually low antibody 
■titers. ' ' 



20 



Antibody response to gpl2 0 in vaccinated 
individuals. The magnitude arid specif icity of the 
■ . .. ^ t ^ 6 ^y res P<>^^ to MN-rgpl2:0 vas measured; by ELISA in . 
all infected individuals throughout the course of: the 
immunization regime (Figure 1 ) . Five of the seven . 
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subjects exhibited normal ant i body response kinet 
that included a small but reproducible pri^ 
fi: 100-1:2,000) and a strong secondary Cbqoster) 
response (titters ranging from 1: 7 , 000-1: 32 ypop) ., and 
antibody responses following third and fourth 
injections that were similar or marginally higher than 
those achieved after the second immunization (Figure. 1, 
Table 3) • 

The antibody response observed in C7. (Figure 1C) 
was unusual in that no antibodies were detectable after 
the primary injection and a titer of only 1:350 was 
detected after the second injection. It thus appeared 
that. C7 did not respond to the primary immunization, 
and that the antibody response obtained after the 
15 '\ second in j ec t i on represented a primary irnmune response . 
: with this hypothesis, the third injection 
elicited a titer of only 1:9, 707 , typical of those 
norma lly seen after two immunizations . 

An atypical antibody response was also seen in 
20 subject C15 (Figure 1G) who was immun ized according to 
; an accelerated immunization schedule of 0,1 , and 2 
months- As expected , the antibody titer seen in this 
subjedt (1:4 , 460) was at. the low end of what is 
i;.: typically achieved after two immunizations, and was f ar 
2 5 ; beidw normal values for three immuhiza t i ons, : The lack 
'■■I of an effective booster 1 response after the rthird: 'V. 
j immunization of G15 was not : surprising in view of 

previous stud res • where an accelerated 0 f . 1 , and 2 month 
^ immunization schedule in baboons [ Anderson et al . 
30 J. ; Infect. Dis. 160:960-9 ( (1989) ) similarly ^prolonged 
: .: the secondary response and f ailed to elicit an : . 
effective tertiary booster response. •v : -,-; : . • 
;: Retrospective analysis of serum and plasma from 
•subjects C6 ( Figure 1 A ) :and C8 ( Figure IB) indicated 
35 that they became, infected with HIV-i at some ; point 

' between the second: and third immunizations.:; .Serologic. 
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evidence of HI V- 1 infect ion was -evident in the gp!2 0 
ant ibody assay swher the titers failed to decline two 
weeks after the second injection and instead formed an 
uncharacteristic hi^h titer plateau (Figures 1 A and 
5 IB) . A similar plateau in MN-rgpl2Q titer after the 
third in j ect ion , -suggested that subject G7 became ' 
infected around week 36, approximately 16 weeks after 
receiving the third injection (Figure 1G) . Subjects 
CIO ( F igu re IE ) ,••/•.<< 1 1 (Figure ID) , CI 5 (Figure 1G) ,' and 

10 C17 (Figure IF) developed unexpected increases in gpl20 
titers, typical of HIV-1 infection , after either the 
third or fourth immunizations. The data obtained 
... demonstrate that immunologic priming for MN-rgpl20 

antibody responses is insufficient to provide universal 

15 . protection from HIV-1 infection.. 

Antibody t iters to the V3 domain. To further 
characterize the antibody response to gpl20, antibody 
titers, were measured: to a synthetic V3 domain peptide 

20 of MN-rgpl2 0 containing the principal neutralizing 
. determinant. (PND). Five of the seven subjects 

developed good V3 titers (1:400 to 1 : 4 000) after the 
second, immunization, however two subjects (C7 and C15 j 
required three immunizations before developing 

25 significant tiers (Figures 1G and 1G) ; As had been 
observed previously (11) the peak V3 titers in some 
individuals (e .g. Gil , C10, C17) appeared to decline. . :; 
with each successive immunization: (Figures ID, IE, and 
IF). After HIV-1 infection, two patterns of V3 

30 reactivity .were observed. Three subjects (C6, C7 , . and 
CIO) showed large increases in titer to V3 domain 
. peptides (Figures 1A, 1C, and IE) whereas C8 

(Figure IB) showed a large decrease in V3 titer. At 
the time of analysis., the data were insufficient tb;. 
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draw any cone lusions regarding the changes in V3 titer?, 
in response to HIV-1 infecHon in subjects; Cll, : G^SjVand . 

The results 1 obtained indicate that the ability to 
5 • form antibodies reactive with the, V3 domain: at various 
time- ; points prior to HIV-1 infection is not a valid 
correlate of protective immunity against all strains of 

' HIV-1.: 

10 CD 4 Inhibition titers. Antibodies that block the 

binding of gpl20 to CD4 represent a heterogeneous class 
of virus neutralizing antibodies. Some are known to 
bind to the C4 domain of gpl20 [Nakamura et.al .; .. 
J. Virol. 67:6179-91 (1993); Anderson et a 1 . ; J . 

15 Infect. Dis. 160:960-9 ((1989)}, and some are known to 
reedgnize conformation dependent discontinuous epitopes 
[Berman et al; ; J. Virol. 7:44 64-9 (1992) ; 
. Nakamura et al. ; J. Virol. 67:6179-91 (1993); 
. McKeating et al. ; AIDS Research and Human Retroviruses 

20 8:451-9 (1992); Ho et al. ; J . Virol . 65:489-93 (1991) ; ■ 
Barbas et al. ; Proc . Natl . Acad. Sci . USA 91:3 809-13 
(1994)]. 

One way to detect antibodies to both types of 
. epitopes is. to measure the ability, of vaccinee sera to 
25 prevent the binding of [ '"i ] -labeled g P 120 to; cell 
. surface CD4 ( [ Nakamura et al. ; AIDS and Human 
. Retroviruses 81875-85 (1992) ; Nakamura et al. ; 
J. Virol. •67:6179-91 (1993) ]. CD4 blocking": titers were 
detected in all seven of the vaccinees prior to. 
30 i infection ; ( Figure. 2) with peak titers that ganged from 
l: 10-1:300. ' At the last 'time po i nt pri or to ; in faction, 
the CD4 titers in five of the seven vaccinees was low 
(1:30: or : less) . ^ne vaccinee (C17) ,• however : , possessed 
a GD4 blocking titer of about 1 : 300 prior tp infection 
35 (Figure 2F) . Thus , the lack of antibodies that block 
the binding- of ;MN-rgpl20 to CD4 cannot account for: all 
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of the in f e c t i on s Lai tcj e in ere a s es in GD4 block i ng 
titers (1 : 100-1: i> OQG) were seen in five of the seven 
subjects after HIV-l infect ion . These ihcIUded 
vaccinees C6, ■C7. r [^^'"c±0i / ;; \ar^ CI 1 . These results 
5 demonstrate that the GD4 blocking titers elicited by 
MN-rgpl20 were lower than those elicited by natural 
infection. 



Virus neutralizing activity . The virus 

1G neutralizing activity of antisera from 

MN-rgpl2 0- immunized subjects was measured using a 
colorimetric ass ay t h art measured the viability of MT-4 
cells after incubation with antibody treated virus 
(HIV-l MN ) . Since the actual date of infection was not 

15 known for any of the breakthrough infections, and serum 
samples were collected : infrequently, the magnitude of 
the neutralizing ant ibody : response at the time of 
infection is not known for any of the vaccinees. 

Of the seven in f e c t i on s examined, the serum sample 

20 closest to the time of infection was that obtained from 
C7, where a neutralizing titer of 1:15 to HiV-l MN was 
present three weeks prior to detection of HIV-1 
. infection (Table 4) > In all other cases,, however, the 
interval between the last injection and the time of : ; : 

2 5 infection was l;b: to: 2 5 weeks : . : 



WO 98/01564 



PCT/US97/0969G 



' TABLE 4 

Neutralization Activity of Sera from yaiccin es 
Infected with HIV-1 



10 



15 



20 



25 



80 
84 



Week : 


C6 


C8 


CIS 


C7 ; 


Cll 


: CIO 


C17 


0 


<10* 


<10* 


<10* 


<10* 


. <10* 


<10* 


<10* 


2 


<10 


<10 


<10 




- 


- 




4 


<10* 


: <10* 


rid* : 


<10* 


<10* 


<10* 


<10* 


6 


10 


8 0 


. -. 


<10 


30 


150 


150 


8 


'. 




nd* 




- 


• - 


' 


10 


_ 


— 


'. 35 


- 


- 


- 


• - '. ; 


15 






. . _ 


<10 




- 




16 


150# 


2 50# 


• 


- 


30 


10 


<10 


24 






i5b# 


<10* 


2 0* 


<10* 


<10* 


26 








70 


500 


200 


4 00 


30 








- 


- 


40 


100 


33 








15 . 


-■■ 


- 




35 










100 


- 


■ "-. 


3 6 








3 0#- 


■ • 


10 


4 0 


52 










; 3 0* 


<10 


<10:: 


54 










250 






57 










100 






63 : 










9 0 






64 














: <10 


77 . 










: 4 0# ; 






78 












500# 


10* 



100 
60 



^69- 
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. • 104 ' '. r5d#. v 

* -indicates immunization . 
5 # - indicates HIV- 1 positive, 

nd - indicates hot done. 
- - indicates sample not available. 

When sera from the two early; /infections were 
10 examined (Table 4 ) , one individual (C6) had a peak 

neutralizing titer of 1:10 ten weeks prior to detection 
of HIV-l infection, whereas the other individual (C8) 
had a neutralizing titer of 1:80 ten Weeks prior to 
detection of HIV-l infection. Subject CI 5, who was 
15 immunized according to an accelerated immunization 

schedule/ developed a neutralizing: .titer of 1:35 after 
the third injection., 14 weeks prior to HIV-l infection. 
Subject CIO , who had a peak neutrail izing titer of 1 : 200 
following the third immunization (week 24) , had no 
20 detectable titer at week 52, six months prior to the 
: : first indication of HIV-l inf ectibn ( week 78) . 

Subject Gil possessed a neutralizing titer of i : 90 
at fourteen weeks prior to detect ion :bf HIV-l and a 
peak titer: of 1:500 following;; the . third- immunization . 
.25 : Similarly vaccinee CI 7 had a neutral izihg titer of 
1:150 fourteen weeks prior to inf ect ion and a peak 
titer of 1:400 at two weeks after the third 
immunization. 

Based on the rate of decay of the gpl20 response 
30 of approximately two months [ Be:lshe. et al ..; JAMA 

272 (6) : 475-80 (1994) ] , as well as the observation that 
: ;. neutralizing titers of. 1:150 decayed, to 1:10 in 10 ; 

weeks in vaccinees C10. and C17, it appears that V 
: V: : neutralizing titers in C8, CI 5:, CI i , and CI 7 . could have 
3 5 V- declined to 1:10 or less in the intervals between the 
; last pre-inf ection serum sample and the. time of HIV-l 
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detection . 

The results: of these -studies demonstrated that all 
vaccinees developed some level of virus-neutralizing 
antibodies at some time prior to HIV- 1 infection, and 
that the magnitude of the neutralizing response was 
probably low at the time of infection. In general, the 
magnitude of the virus-neutralizing response observed 
in the individuals that became infected with HIV-1 was 
comparable to that seen in non- infected vaccinees as 
described in Belshe et al.:; JAMA 272 (6) : 475-80 (1994 ). 



Sequences of Viruses. To evaluate the similarity 
of the breakthrough viruses with the vaccine antigen, 
nucleotide sequences for \gpl20 from all seven 

15 breakthrough viruses were determined. Envelope 

glycoprotein genes were amplified 1 from prqviral D^A 
using the polymerase chain reaction. Sequences were 
obtained by direct amplification of DMA from lysates of 
gradient-purified lymphocytes obtained directly from 

20 patient blood without any intermediate tissue culture 
or amplification step. 

A listing of the complete gpl20 sequences (two 
clones per specimen) is provided in Figure 3, All 
seven envelope glycoproteins possessed sequences v 

25 typical of subtype (clade) B viruses. The overall 

homology with MN-rgpl20 ranged from 69-80% (Table 5) . 
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- TABLE 5 

Comparison of MN-rgpl2 6 Sequence with Seqfuenc s 
from Infected Vaccinees* 







MN 


C6..1 


C8 . 3 


C7.2 


C11.5 


CIO. 5 


C17.1 


CIS. 2 


5 


MN 


100 


79 


78 


7 b 


75 


69 


80 


72 




C6.1 




100 


78 


70 : 


81 


75 


90 : 


79 




C8.3 






100 


68 


80 


76 


84 


83 




C7.2 








100 . 


80 


73 


7 6 


73 




CI 1.5 










100 


7 5 


7 0 


80 


10 


CI 0.5 
CI 7.1 
CIS . 2 












100 . 


70: . 

100 


72 
87 
100 



Data indicate percent identity. 



15 Interestingly , a high percentage (four of seven) of the 
breakthrough viruses differed from MN-rgpl20 by 2 5-30% 
[Myers et.al. ; Retroviruses and AIDS Database, Los 
Alaimos National Laboratory (1992 and 19 95) J . : 
Historically this degree of sequence variation is 

20 typical. of inter-subtype ( intra-clade) variatipn rather 
thai), intra-subtype variation which is e xpect ed to be in 
the 10-2 0% range [Myers et al . ; Retroviruses and AIDS 
Database, Los Alamos National Laboratory (1992 and 
199 50] . Of the viruses with the greatest hoiholqgy to.-. . 

2 5 MN-rgpl2 0V two (C6 and C8) occurred as early 

infections/. prior to complete immunization, and one 
. (C17) occurred as a late infection. 

Polymorphism in the V3 Domain. Of particular : 
30 in terest were polymorphisms in regions known to contain 
epitopes recognized by virus neutralizing antibodies. 
The best characterized neutralizing epitope, thW;: / 
principal neutralizing determinant (P.ND) , occurs at . the 
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tip of the V3 loop. .In subtype B viruses ■ 
Approximately: 60% possess the MNserotype-^e fining 
signature sequence/ IGPGRAF ( S EQ »• Ib J : iife. 3:9 j. , based on 
identity with thes protbtyp ic MN strain of HIV- 1 
[Berman et al. ; Jj. Virol:. 7 : 4464^9. (1992| : : 
Myers et al, ; Retroviruses and AIDS Database', Los 
Alamos National Laboratory (1992 and ; 19.95 ) ; 
La Rosa et al. ; Science 249 : 932-5 (1990) 

Three of the viruses (C6, C8 , and Ci7) possessed 
the MN serotype signature sequence (Figure ; 3) . In 
contrast, four viruses possessed sequences with radical 
amino acid substitutions in the PN D : [ I G PGR AW (C7) , 
LGPGSTF (Cll) , IGPGRVL (CIO), and IGPGSAF; (C15) ] 
(SEQ. ID. NOs. 40-43, respectively) , and therefore were 
15 classified as "npn-MN like" viruses. Of note, each of 
the four "non-MN-like" sequences were: rare (Table 6) 
^nd were not typical of the most common "non-MN" 
variants of subtype B viruses [Myers et al. ; 
Retroviruses and AIDS Database, Los Alamos National 
20 Laboratory (1992 and 1995) ] . 
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Frequency of Pplym rphisms at the Prihcipai 
Neu t r a I i z i ng pet erm i nan t in HI V- i inf ec t ed 
Individuals Inununi zed with MN-rgplip* 



V3 . Sequence 


: Observed :: 




Dataset 


Frequency 










GNE . 


LANL 


LANL. 1 


La Rosa 


Seauence 


n 


Frecruencv : 


tn=S2\ 


(n=519V 


m = 160 V ; 


(n = 2 4.5.1 


GPGRAF 


3 


CL42 . : ■ 


.0 . 67. 


0.5 7 


0.66 . 


0. 60 


GPGRAW 


.1 


0. 14 


: 0. 03 


0.013 


0.06 


0.010 


GPGRVL 


1 


0. 14 


<0.02 


. 0.004 


. <0. 006. 


<0.008 


GPGSTF* * 


a 


0. 14 .: 


<0.02 


<0.002 


<0.006 


<0.004 


GPGSAF 


l 


0. 14 ; ; 


0.02 


0.011 


<0.006 


< 0.004 


* - 


Data set GNE 


refers 


3 to a 


collection of 



.• 52 independent isolates collected in 1992; 
dataset LANL- refers to a collection of 
519 sequences reported by Myers et al., 
Retroviruses and AIDS Database; Los Alamos 
20 National Laboratory 1992 and 1995 ; . LANL. 1 refers 

to a collect ion of 160 epidemiological ly unlinked 
individuals provided by B» Korber (personal 
communication) 7 dataset La Rpsa refers to sequence 
data reported by La Rosa et al. , Science 249:932-5 ; 

2 5 . ( 1990). : * 

** - Sequences were not present in the data sets 
examined .. 

The prevalence of viruses with PND sequences 

3 0 matching the breakthrough viruses ranged from a high of 

1.3% (C7 ) to a low : of 0.2% (Cll) in a listing of 519 
subtype B sequences, compiled by the Lbs. Alamos National 
Laboratory [Myerset al . ; Retroviruses and AIDS 
Database, Los Alamos. National. Laboratory (1992 and 
35 1995) ] . : Simiiariy low frequencies were observed in 
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three other independent ly derived data sets (Table 6.) . 
The bbcurrience of these s eq u e n c e s d i d ; n o t d i f f e r 
/ sighifiG^htly be twe e> n data set s coll ect ed pr ior to^ : 1 
[La ftqsa et ; al.. : ; ; . /Science 249 : 932-5 (1990) ] . and data ; ; 
5 col lected i^92 t or from a set of 160 epidemioiogi<i:allV 
unlinked individuals (B . Korber, personal . : 
communication) ; All four sets of data agreed that the 
prevalence of viruses with MN-iike PND sequences was in 
the range of 60% . Based on this data, four of the 
10 seven breakthrough infections were, determined to be ... 
caused by viruses that fell outside of the spectrum of 
viruses that the : vaccine was: expected to prevent . 

Other features of breakthrough virus V3 domains. 
15 Like MN-rgpl20 7 the V3 domains of all of the 

breakthrough viruses were 36 amino acids in length:. • 
However; all seven viruses differed from MN-rgpl20 with 
respect to the number of glycosy lat ion sites and with 
respect to the syncytium- inducing (SI) signature V • 

20 ' : sequence • ';• 

The sequence of MN-rgpl20.is somewhat unusual 
[Myers etal-; Retroviruses and AIDS Database > Los : y 
. Alamos National Laboratory (1992.. and 1995) ] in that it 
. . lacks . an' ^ s ite at posit ion 3 06 in 

2 5 the V3 doma in. The lack of : th i s glycosy lat ion site 

does not appear to be ant igenica 1 ly significant since ■ ■ 
ant iseira tb MN-rgpl 2 0 are known to neutralize a varietiy 
Of viruses - (e .g:. SF-2, DU6587-5, DU4 4 89 - 5 , CC) that; : ; : 
possess a glycosylation site at this position 
30 [Berman et al . ; : J. : V^iroi . : 7 : 4464^9 (1992) ] 

In addition, the V3 domain of MN-rgpl20 possessed 
sequence po 1 ymor ph i sms (R at position 311, K : at v ... 
position 324, K at position 328) typical of syncytium 
inducing viruses [ Fduchier et al . 7 J. Virol. 66 : 3153-87 
35 (1992 ) ] wher ea s all seven breakthrough viruses 

possessed sequences associated with non-syncytium- 
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inducing viruses , Syncytium-inducing viruses ha ve^ been 
associated with rapi^ e t 
zl. ; J. Virol . 62 : 2 026-3^ and T ceil :.trbpi:sni V; - 

[O'Brien et al. ; Nature- (Lbndohj 348 : 69-73 (1990) ; 
5 Shioda et al. ; Nature/ (London) 349:167-9 (1991) ] . To 
date viruses with these properties have not been 
recovered from any of the : MN-rgp 1 2 0 immunized 
volunteers . 



10 Polymorphism in the VI, V2 and C4 domains. 

Previous investigations : have identified additional 
neutralizing epitopes in the VI y V2 and C4 domains of 

. gpl20 [Nakamura et al..; J. Virol . 67:6179-91 (1993); 

McKeating et al..; AIDS Research and Human Retroviruses 

15 8:451-9 (1992) ; Ho et a 1 . ; • J . Virol . 65:489-93 (1991); 
Barbas et al. ; Proc.Natl.Acad. Sci . USA 91:3809-13 
(1994) ; McKeating et al . J . Virol . 67 : 4932-44 (1993 ); 
Moore et al. ; J. Virol.. 67:6136-6151 (1993) ; 
Davis et al. ; J . Gen . Virol . 74 : 2 6 0 9 - 1 7 (1993) ] . 

20 best characterized of these neutralizing 

epitopes is in the C4 domain which has attracted 
special attention because antibodies binding to this 
area are known to block ^ of gpl2 0 to CD 4 

[Moore et al. ; AIDS i:: 1 5 5-6 3 ;( 1 98 9 > ; McKeating et al. ; 

2 5 AJZ?S Research and -Human' Re firoyijr'uses 8:451-9 (199 2 ) J . 
: Because the epitope is ; located in a conserved (C) 
domain, naturally-occurrihg' polymorph in this region 
is far more limited than in other neutralizing 
epitopes. Nakamura et a l . J . Virol . 67 : 6 179-91 ( 1993 ) 

30 reported that the binding of a number of neutralizing 
MAbs was dependent on K at position 429. 

: . Comparison of the : : sequence of MN-rgpl20 with other 
strains of HIV-1 showed that a common polymorphism, 
involving the substitution of E f or K , occurs, at this 

35 position. Indeed, substrains of the same virus;: isolate 
often show polymorphism at; this position . The HXB2 
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substrain of HIV- lj^, contains K at position 429 
whereas the BH1G , : IIIB# and LAV substrains of the 
HlV-iui contain E at. this position [Nakamura et al. ; 
j. Virol . 67 : 6179-91 •( 1993) ]. Similarly, the 1984 
5 isolate of HiV-l^: -exhibited E at this position, while 
the 1990 isolate of HIV-W/ used to produce MN-rgpl20 r 
possessed K at this position. 

When the sequences of; the infected vaccine 
recipients were examined (Figure 3), the virus from 
10 subject C17, like MN-rgpl20 contained K at position 

429, whereas the six other viruses that differed, from 
the vaccine immunogen possessed E at this position. 
These results demonstrated, that six of the seven 
breakthrough viruses differed from the vaccine 
15 immunogen at the CD47Mockihg, neutralizing epitope in 
the C4 domain of gplio. 

Studies with monoclonal antibodies have defined 
neutralizing epitopes in the VI and V2 domains of gpl20 
[McKeating et al . ; J. >iroI 67:4932-44 (1993); Moore 
20 et al.; J. Virol. 67:6136-6151 (1993); Davis et al. ; 
j m Gen. Virol. 74 : 2609-17 (1993) ] . Like the 
polymorphisms that occur in the C4 domain, the V2 
domains exhibit severa 1 common polymorphisms that 
affect the binding of yir us; neutral 
25 One such polymorphism oGcur at position 171 Which is 
critically important -f or -the binding ^ 
MAb 1025/ whereas residue 187 is important for the 
binding of MAb several ,MAbs : represented by 1088 . 
When the V2 domain sequences were examined 
3 0 (Figure 3) , all of the. Infe 

differed from :MN-rgp 1:20 in that R replaced G at 
position 17.1 and 1 ;: 6r : V ..replaced E at position 187.. -.. 
Antibodies recognizing these adjacent sites in the 
V 2 domain of MN-rgpl20: wpiild. not be expected to 
35 neutralize viruses : with rad leal amino acid 

■ substitutions at these position. Thus , all seven 
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breakthrough viruses differed from MN-rgpl2 0 at a 
neutralizing epitope in t he V 2 d oma i n of gp 1 2 0. 

Other neutralizing epitopes have been in 
the VI domain of gpl20 [O'Brien et ai . ; ■ 'Nature (London) 
5 348 : 69-7 3 (1990) ; McKeating et al. ; Virbi . 

67 : 4 932^44 ( 1993 ) ] . Although the neutralizing epitopes 
in the VI domain of MN-rgpl20 have not been 
characterized, the polymorphism seen among the 
breakthrough viruses in this region was interesting. 

10 Particularly striking (Figure 3) was that the length of 
this domain ranged from 20 amino acids (C17) to 
4 5 amino acids (C6) , and the number of N-linked 
glycosylation sites ranged from 2 to 6. In contrast, 
the VI domain of MN-rgpl20 is 31 amino acids in length 

15 and encodes three N-linked ^glycosylation sites . 

Although examination of sequence: databases suggest 
that variation in the V2 region is comparable to the VI 
region , the V2 region of the breakthrough viruses 
showed less variation than expected . Specifically, the 

20 length of the V2 region ranged from 36 amino acids (C7) 
to 3 9 amino acids in length, with six of seven viruses 
containing three N-linked g ly cosy lat ion s ites in this 
domain. A high degree of polymorphism was. found in the 
V.4 region where sequences ranged from- 26 (C10) to 33 

25 (CI 5 / C7) amino acids in length arid contained either 4 
: or 5 N-linked glycosylation sites. 

Antigenicity of envelope glycoproteins from 
breakthrough viruses. To determine : the significance of 
30 sequence variation on glycoprotein an tig e n i c i t y , 

; • recombinant gpl20 was prepared from the viruses of all 
seven infected vaecinees (Figure 4) . in : these studies 
a series of MAbs was assembled and their bindihg. to 
MN-rgpl20 was compared. to that of rgpl2 0 from the 
35. vaccinee isolates by ELI S A (Table 7) . 
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• TABLE 7. : ' .. 
Relative Reactivity* of mb Bind ing to rgpl2 0 from 
infect d Subjects Compared with Binding to MN-rgpl2 0 









V3 " : - 


Discont inubus 




V £t 


: ::'.5. : :, : :. 


qpl20 


1034 


50. 1 


1.5E 


1025 


:. ::id24 


1088 




MN : 


: 1.0 


1. 00 


l.ob 


1 . 00 


• • i ,V:V.' : , :: 1.00 • 


1 .00 




C6.5 


0.37 
0.3 3 


0. 3 7 
0.3 3 


0.17 
0.75 


0.00. 

b.oo 


b. oo 

0. 00 


0.00 
0.00 


10 


C8.3 
:C8. 6 


0. 11 
0.14 


0.37. 
0 . 3 4 


0.38 
0.2 9 


0.00 

6 .oo 


0 . 00 
0.00 


0.00 
0. 00 




C / . £. 


0,47 


0 . 60 


0.71 


0.00 


. b.oo 


0.00 




Cll. 5 
Cll 7 


0.00 
0.00 


0.00 
0.00 


0.17 
0.17 


0.00 
0 . 00 : 


: 0.00 

: 0.00 


0.00 
0.00 


15 \ 


cro.s 

CIO . 7 


0.3 3 
0 . 4 2 


0.40 
0 . 4 8 


0.4 6 
0.50 


0 . 2 4. ; 
0.29" 


0 . 03 
0.07 


0. 04 
0 . 09 




C17 . 1 
C17 .3 


0.3 3 
0.37 


0. 52 
0 . 56 


0.3 3 
0-33 .. 


0 . 00 
0 . 00 . 


0.3 0 
. . 0.38 


0.07 
0 . 06 




CI 5. 2 
CI 5. 3 


0.00 
0. 00 


0.4 7 
0. 3 7 


0.92 . 
0.63 


0 .00 
0 . 00 


0.00 
0.00 


0. 00 

0,00 


20 


.* 


-Relative reactivity values represent ratio of 




optical 


densities obta ined 


with 


rgpl2b from 






patient 


isolates 


divided by optical density 






obtained 


for MN- 


rgpl20. at 


a .MAb 


c b n c eh t r a t i on 


Of 


25 


. 2 


micrograms per 


milliliter. 







In control experiments, the bind ing of MAb 5B6 
• >;;Cwhich is. specific for the HSV ! . gD^l '% J a g:v. epitope fused 
^y'? } ^o;-^ N terminus of all of the rgp!20 protein) was 
30 used to standardize the amount of gpl2 0: from each 

isolate (Figure 5A) . These studies demonstrated that 
the; assay was carried put under conditions where 
. equivalent amount of rgpl20s were • captured onto wells 
of microtiter plates. 
35 V ^ The antigenic structure of the: ;V3 ; domain was 
examined using. the 1034 MAb (isolated from mice 
immunized with MN-rgpl2 0 as described in 
Nakamura et al . .; J. Virol . 67: 61 7 9 -pi ; (1993) and the 
' 50.1. MAb . (prepared, f rom mice immunized with a synthetic 
40 V3 domain: peptide as described in Rind et al . ; Proc . 
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Wa tl ; ; Acad ; Sci . L7SA 9 0 : 6 3 2 5 - 9 (199 3 ) . Both :MAbs: are 
kribwh to exhibit pofce virus neutra 1 i ring activity ; 
When binding to the recdmbinaint pr o t e i ns w a ^ examined 
the MAb binding to MN-rgpl 2 0 was at least 10-f bid ; 
5 greater than to any of the breakthrough virus; envelope 
proteins (Figure 5 B and C) ..• Surprisingly, rgpl20 from 
the three patient isolates (C8,C6, and C17) that 
possessed the MN serotype-de fining sequence , IGPGRAF :•' 
(SEQ. ID. NO* 39) , varied from one another in their MAb 

10 binding activity. Thus, the binding of MAb 103 4 and 
MAb 50. 1 to rgpl20 from CI 7 was significantly greater 
than the binding to rgp!2 0s from C6 and C8 . 

A distinction in the epitopes recognized by these 
MAbs was evident since C6-rgpl20 and C8-rgpl20 gave 

15 comparable binding with 50.1, whereas 1034 bound, better 
to the G6-derived protein than the CB-derived protein. 
The poorest MAb. reactivity was with rgpl20s f rom CI 1 
and C15 . This result was consistent with sequence 
analysis demonstrating that these two viruses both 

20 possessed the radical substitution of S for R at 

position 18 in the V3 domain. Surprisingly, both of 
these MAbs exhibited better than expected binding to 
rgpl2 0 from the C7 and C10 viruses . Like MN-rgpl20>V 
bo t h pro te ins contained, the penta- p e p t i d e , I G PGR 

2 5 sequence (SEQ . ID . NO . 4 4) in the V3 loop, but differed 

. f rom M!if-rgpi2b: in that V and L replaced A and F at j 
positions 31? and 320 in gpl2 0 from CIO, and W : replaced 
F at position 320 in gpl20 from C7 . These results 
indicate that R at position 318 is essential for the 

3 0 binding of these two MAbs, and that the epitopes 

recognized by 103 4 and 50, 1 are not completely :: 7 
destroyed by the hydrophobic substitutions ;at 
posit ions 319 and 32 0 . 

As predicted from the sequence data , there was 
35 little if any binding to the breakthrough virus rgpl20s 
using MAbs (1088 and 1025) directed to the V2 region of 
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MN-rgpl20. Also consistent with sequence data was the 
^; - d irected to: the ; G4 domain of 

MN-rgpl20 gave some reactivity with-G which, 
like MN^rgpl2 6 cbnta ined K at pbsition 429 r but gave no 
: 5 reactivity with the other isolates that contained E at 
residue 429 . 

Together, these studies; demonistrated that the 
antigenic structure of all seven breakthrough viruses 
differed from the vaccine immunogen at: three well 
.10; characterized neutralizing epitopes. 

A totally dif ferent pattern; pf reactivity was 
observed with the human hy br idoma , .MAb 1 5e , prepared 
from an HIV-1 infected, individual; ..^ diescribed . in Ho et 
al. ; J. Virol. i (55:489-93 ( 1991 ). ;■ With this MAb, the 
.15 v\ greatest binding was achieved 1 with MN-rgpl20 and rgpl20 
from C7, and the poorest reactivity ;was.. ; seen with the 
. :: . two. clones of . rgp 1 20 f rom the: CI} . : Mpderate , but 

comparable reactivity was seen.: with rgp 120s from the 
'.' •..-CIO and C17 . 

20 These results demonstrate that the 1 5e epitope is 

: ;■ po 1 ymor ph i c , and t h a t the epitope* is conserved on 
; V V. : MN-rgpl20 and rgpl2 0 from C7, but has been lost on 
rgpl2ds from CI 1 . . Interestingly , the. two different 
clones of gpl2b; derived from C6 gave strikingly : 
f 2 5 : ' dif ferent patterns of a nt i body bind Thus , rgpl2 0 

;';';•' from clone C6.5 exhibited strong reactivity with this 
.antibody, whereas rgpl 2 0 from elones; C6. 1 exhibited 
significantly weaker activity, with this MAb. 
Comparison of sequence data ( Figure 3 ) showed that the 
3 0 .two C6 clones differed at 6 amino acid positions. 

Based on comparative binding to the oth er viral 
• proteins of known sequence, it appeared that the 
; : substitution of K for I at position 3 51 in. the C3 

domain of gpl20 could account for the difference in 
v.35 . binding activity. This result is also consistent with 
.''•••'.''•both clones of CI 1 similarly containing , a positively- 
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charged K ; at this position, and: also being poorly 
react i ve with t h i s MAb . Alternatively , a T . f or I 
substitution at position 439 in the C4 domain could; 
account fbr th^ difference in 15e binding between G6 . 1 ■ 
5 and C6. 5. : Aithough the inability of the two CI I clones; 
to bind 15e : cannot be explained by polymorphism at this 
\ position in the C4 domain, they could be affected by 
the adjacent T: for M substitution at position 434 . 

1.0 Discussion " 

In these studies, the viruses and immune responses 
in seven of nine vaccinees who became infected, with 
HTV-i through high risk acti vity . whi le participating in 
Phase, i or -Phase 2 trials of MN-rgpl20, a candidate 

15 HIV-l vaccine were analyzed. Such infections would be 
expected to occur for one of two reasons: i) lack of 
sufficient! mm u h e response at the time, of infection; br 7 
2) infection with viruses that fall outside of the 
antigenic spectrum expected to be covered by the 
.20 vaccine, immunogen; The, data indicate : that both . 
explanations may be involved with the infections 
observed (Table 8). 
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' . TABLE • 8 ' 
Summary of Breakthrough infections 

•"..''"■ Homologous to MN-ropl20 : 

MN-rgpl20 ".'/.V ^ : ;;:A=^.vV.. : ^;--. : ' •■■ 
5 Case No. Adequate Homology 7 V3 : : : " : . "-' e4 v ^ 

Immunization (%) \ PND Epitope Epitope 

C6 • 1 ' ■-: ;"'-' V 79 . - •" : - y ^-: : yO '..':' - " - 

C8 - 78 - - , 

; • ci5 - ' 72 ■ - 

C7 - 7 0 ; ' - 

10 : cil ; 7 5. ' ; .*y - . 

CIO : 69 • v; 

C17 ' .+ ■..' 80 + .•„'■■■ ■'".''> . 



15 Two of the infections occurred ln : i nd 1 v i dua Is who 

failed to receive the minimum /three doses . of vaccine 
typically required for the induction of : pr otect i ve 
immunity with protein subunit vaccines (e.g. hepatitis 
■ B virus formulated in alum adjuvant described in 
2 0 Francis et ai.; Ann. Int. Med . 9 7: 362-6 (198 2). Two 
:; : additional breakthrough infections occurred in 

vaccinees who had weak or u n d e t e ct a b 1 e p r i mar y (C7 ) and 
booster (C15) responses. Of the three individuals who 
; became infected with HIV-1 a f t e r receiving three or 
2 5 more productive immunizations "■: ( CIO ;/ :, Cl l , and C17 ) : , at 
least two, and possibly all three, appear to have 
become inf ected more than six : months/ af ter receiving 
their last immunization. Because antibody titers to 
MN-rgpl20 typically decay with a half-time of : 2 to 2.5 
30; months [Belshe et al,; JAMA 272 {6) :47S'B^ 1,1994) ; 

Berman et al . ; AIDS 8 :591-601 (.1994 )], antibody titers 
, : would be expected to have decayed at least eight-fpld 
and possibly as much as sixty four- fold at the time of 
.infection. Thus v ; the lack of a sufficient immune 
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response at the t i m e of infection represents ^ 
potential explanation for at least six of the seven 
breakthrough infect 

Data from vaccine efficacy studies in gpl60 
5 immunized chimp al . ; Longitudinal 

Vaccine-Induced Immunity and Risk Behavior of Study 
Participants in AVEG Phase II Protocol 201, In: 
Abstracts from Eighth Annual Meeting of the National 
Cooperative Vaccine Development Groups for AIDS. 

10 Bethseda/ MD 1996:216] challenged with HIV-.lv" and 
gpl 2 Q- immunized rhesus macaques challenged with a 
chimeric SIV/HIV-1 virus (SHTV) suggest that the 
magnitude of the neutralizing antibody response at the 
time of Infection is a critical correlate of. protective 

15 immunity. If maintaining neutralizing antibody titers 
proves to be a valid correlate of protective immunity 
in humans, then formulations (e.g. novel adjuvants) or 
immunizatibh regimes ( frequent boosting) designed to 
maximize the antibody responses may be required to 

2 0 achieve long lasting protection. Use of a booster 
every; si*: 

• The / other likely explanation for the late : . 
infections lis the Antigenic difference between the ^ 
vaccine and the breakthrough vi irus enve 1 ope 

25 glycppirotisiris . This explanation is supported by the 

observation that four of the seven breakthrough viruses 
possessed envelope glycoproteins that differed from: the 
MN-rgpl2 p : by 25-3 0% at the amino acid level . 
Differences of this magnitude have historically 

30 [Myers et al . ; Retroviruses and AIDS Database, Los . 
Alamos National Laboratory (1992 and 1995) ] been 
associated with inter-subtype variation and far. exceeds 
the average 10^20% variation expected for 'viruses-';' 
within the. same subtype. 

35 Although the biologic significance of sequence 

variation : in :many regions of the envelope glycoprotein 
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is unclear, polymorphism is an 

' \ important ' factor that af f e cits y a adi h e e f f f 
' ■ v :: previous studies [ Sa lmon-Ceron et v al:'.::;- AIDS Res . and 
; : jfuman Retrovirus 
; 5 al. ; Science 250: 1590-3 ( 1990 j ] have - demohstrat:ed that 

' ^ • : : - the breadth of neutralizing activity: ^ 

elicited by HIV-1 envelope derived vaccines was 
J critically dependent on the sequence : of epitopes in the 
: : V3 domain (e.g. ; the PND) . Thus, candidate vaccines 
10 based on the LAI strain of HIV-l (the prototypic 

■^■""hon-MN-like" subtype B virus) , exhibited little or no 
. cross neut ra 1 i z ing activity with/subtype B viruses, 
whereas vaccines that contained the M MN-like- ,, PND. 
: sequence ( IGPGRAF) ( SEQ . : ID . NO. 44 ) exhibited broad 
15 cross neutralizing activity. That four of the seven 
"breakthrough viruses possessed envelope glycoproteins 
\?":'with^ radical amino acid substitutions in the PND is 
: .. consistent with the explanation that d if f ere rices in 
antigenic structure explain some of these infections. 
2 0 Over the last few years, it has become clear that 

"""H'":7" polymorphism among "MN- like" viruses occurs at 

neutralizing epitopes outside of the : PND . The best 
example occurs in the C4 domain where two antigenically 
: ; . ;:: : ; v iV-' : ; : distinct variants are /distinguished by the: presence of 
^25 • . either K or E at pos it ion 4 29 , [ Moore -?t : Jai. AIDS : 
3:155-63 ( 1989) ] . Because six of the seveiv 
breakthrough viruses differed from the vaccine strain 
: in that they conta ihed E rather than K at' position 429, 
antibodies raised to the C4 domain of MN-rgpl2 0 were 
30 unlikely to neutralize the viruses infecting in six of 
the seven vaccihees. 

Other heutralizihg epitopes are known to be 
. present in the VI and V2 d oma ins of gp 120 . Although 

these regions are highly variable^ due to insertions 
^5 and deletions, neutralizing ep i t opeis h^ ye be en 

: described by McKeating. et al,; JV ^ .. 
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(1993) ^; : : Mp6re et al. ; J. Viroi . 67 : 613 6-6151 : (:1993 ) ; 
and DaVis et al . ; J. Gen . Virol ., 7 4 : 2 6 0 9 - 1 7 ■ (1 993 ) . 
Several of these epitopes over lap ah amino: :termina 1 
sequence of the V2 domain containing the tri-peptide 
5 sequence RDK at positions corresponding to 14 2 to 144 
of MN-rgpT20 [McKeating et al. ; J . Virol . 67 : 4 932-44 
(1993 ); Moore et al . ; J. Virol. 67 : 6136-6151 (1993 )] • 
Like; the G4 epitope, variation in this sequence is 
known to occur between different substrains derived 

10 from the same parental isolate. Since all seven 

breakthrough viruses differed from MN-rgpl20 in that 
they possessed the RDK sequence, rather than the GDK 
sequence present in the vaccine antigen, neutralizing 
antibodies to the V2 domain of MN-rgpl20 would not have 

15 been expected neutralize any of the viruses recovered 
from the vaccinees immunized with MN-rgpl20. , • 

Although polymorphisms at neutralizing epitopes • 
might account for the lack of protection in most of the 
infections, this does not appear to explain the ' : . 

20 inf Section of vaccinee C17 , who was infected by a virus 
that: -matched .MN-rgpl 20 in the V3 and C4 domains. If a 
difference in sequence was responsible for th£ lack of 
protection in this case, the, critical difference might 
re iate'to ; the unusua 1 sequence in the VI domain of 

25 gpi2b ; vfi:bft . this breakthrough virus . Several studies 
hive shown that the VI domain possesses epitopes y 
recognized by virus neutralizing monoc Ion a 1 a ri t i bod i e s 
[McKeating et al. ; J. Virol . 67 : 4932-44 (1993) ; 
Davis et al . ; J . Gen . Virol . 74 : 2609-17 (1993) ; ; : : . : 

30 Kay man et al. ; J . Virol . 68:4 00-410 (1994) ] .. 

. Although far less is known about the VI. epitopes 
re 1 a t iy e to other neutralizing sites the Vi epitopes 
appear to be conformation-dependent , and antisera from 
HIV-l infected individuals recognize epitopes in the VI 

35 and V2 .domains [McKeating et al.; J. Virol. : 67 : 4932-44 

(1993) ; Kayman et al. ; J. Virol. 68 :400-4 1.0 (1994) ] . . 
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The VI sequence of th 

: .' : ;^';V : .v :: :b^c:ause- it is sniailer .and contains few^ 

^iycbsy la tion sites than that of v MN-rgp^2 <> or any of 
; r the other breakthrough viruses . By the saine token, the 
5 ; envelope glycoproteins from CI 1 and G6 are noteworthy 
because they are significantly larger and contain more 
glycosylation sites than MN-rgpl20 or the other 
^breakthrough viruses. 

While differences in amino acid sequence can 
10 Vprbvide clues to differences in antigenic structure, 
the consequences of such polymorphism; can only be 
proven through antibody binding studies To. correlate 
differences in sequence with d i f f erences i n antigenic 
structure, gpl20 from two clones each ; of all seven 
15 breakthrough viruses was expressed and the antigenicity 
. . ■ of the clones with a panel of monoclonal antibodies was 
■.•:;'.--exainined. As predicted from the sequence data, none of 
the breakthrough virus envelope glycoprpteins : reacted 
.:, with neutralizing MAbs to the V2 domain of MN-rgpl2G> 
20 When MAbs to the C4 domain were examined, only the C17 

i envelope glycoprotein (that matched MN-rgpl20 with 
: '\^,^\r : espect to K4 29) showed significant albeit slower 

.binding. Surprisingly, the three breakthrough envelope 
: • glycoproteins that contained the ,s^^ 
25: : bbh^ert^us seque nc^ , I GPGRAF (SEQ . ID. M-^l "\ j gave . 
poor reactivity with all three PND directed MAbs , even 
though they possessed PND sequences closely related : to 
the vaccine immunogen . Thus, all three of. the vaccinee 
isolates appeared to possess changes outside of the 
30/ recognition/ site that interfered : withMAb binding. 

It has been known for many years that resistance 
. to neutralization in vitro can sometimes be attributed 
to mutations in remote sequences that alter the 
conform of neutralizing epitopes and, interfere ; ■■■ 

3$ with recognition by .virus neutralizing 1 antibodies 
/:..:.',":>' [Nara; :et al. ; J . Viral. 64 : 3779-91 (1990) ; Cordonnier 
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; ^ et al . ; Nature 34 6 : 571 ^4 ( 1989 ) } ; Together- these 
results indicate that the antigenic structure of the 
envelope glycoproteins recovered, fro^ 
1 y i^ses differed sighif icantly from "that of the ' vaccine 
5 antigen. ; 

A novel result was the loca 1 i zat io^ residues in 
the C3 domain that appeared to affect the binding of 
.•• : ..:v'.; ; : the virus neutralizing human MAb, I5e. This MAb is 

known to recognize a discontinuous epitope, block CD 4 
10 binding, and neutralize a variety of laboratory and 
primary isolates of HT.V-1 [ Ho et al 7 J;. Virol . 
65:489-93 (1991); Thai i et al. ; J\ '„ Virol ..66: 5 63 5-5641 
- (1992) ; Moore et al. ; AIDS Res. .Hunv: Retroviruses 
9: 1179-1187 (199 3) ] . ' 
15 ;: Comparative binding to envelop^, glycbproteins from 
the br eak throu 9 h vi ruses indicated that recognition by 
this antibody is critically dependent ori : residues in 
the C3 or C4 domains : of gpl20 . The unique occurrence 
: ' °* * positively charged K at position 351 ; in the C3 

20 domain provides a common explanation for inability 
of the Cll. 5, Cll. 7 and G6.1 strains of HIV-1 to bind 
. 15e. Alternatively, it is possible that different 
amino acid substitutions in different locations account 
, ; ;■ for the failure of I5e to bind tp ^ the C6 . 

25 and Cll clones. The only obvious pos i t i oris where 

: .substitutions of this : type occur a.r e i n t he : C4 • doma in 
where T replaces M at 4 34 (Cll) and. T replaces: i at 

; 439 : '. . .' . . • ; . . 

.'• The present studies demonstrate that the current 
MN-rgpl2 0 is less than 1G6% effective 
^ inst HIV ~ :1 . infection. Based on previous in vitro 
. .• and in vivo studies with MN-rgpl2 0, ^rdtection from 
natural HIV-1 infection in humans i s expected . to depend 
on a threshold concentration of v i r us-heu tr^ 1 i z ing 
35 antibodies, and antigenic similarity between the 
vaccine immunogen and the challenge virus. 
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In this regard, only one of the-seven breakthrough: 
v :: ; infections :( CI 7) was unexpedted. This individua 1 
received a full course of i mmu n i z at i oh s yet became 
'± : :" : 'inf-±cted.. with a virus similar to MN-rgpl 20 at at least 
5 : two important neutralizing epitopes ;(:V3 : and C4 

domains). This infection might be related to the 
magnitude of the antibody response at thie time of 
infection, or antigenic differences between the 
breakthrough virus and the vaccine strain, or 
10 circumstances of infection (e.g. , ulcerative lesions , 
infect ion by donor with acute infection or high, 
: viremia) f not monitored in this protocol. 
Alternatively this individual may represent a true 
.vaccine failure, without clear explanation. . 
15 On balance, the analysis of breakthrough 

"infections described. herein did not uncover any data 
that would discourage the continued development of 
MN-rgpl20 as a vaccine to prevent HIV-1 infection. The 
results support speculation that enhancing vaccine 
20 immunogen i c i ty (as by additional booster immunizations.) 
; ihay be required to maintain long term . protective 

and that the addition of rgpl2 0 from other 
antigenically different strains of virus in add it ion to 
: :: MN-rgpl2 0 aire usef ul to expand the breadth: of 

2 5 protection. 

, The availability of viruses and ::yir^l : ; 
-glycoproteins derived from breakthrough infections may 
provide an Important means to streamline the process of 
identifying new antigens for inclusion into a . 

3 0 multivalent vaccine. Recombinant viral glycoproteins 

prepared from breakthrough v iruses , by def inition , 
possess antigenic structures that ares i g n i f i c a n 1 1 y 
different from MN-rgpl20 , and are be representative of 
viruses currently being transmitted.:. Thus:, combining 
35 rgpl2p from breakthrough viruses w i t h MN-rgp 12 0 is an 
. v : effective way complement and significantly expand 
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antigenic complexity and increase breadth of cross 
neutralizing activity, ,. v ; 
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SEQUENCE LISTING 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



( 1 ) GENERAL INFORMATION: , : 

(i) APPLICANT: Berman, Phillip W. 
(ii) TITLE OF INVENTION : .. HIV ENVELOPE POLYPEPTIDES AND 

VACCINE : . 

; (iii ) NUMBER OF SEQUENCES : 44 
: ( iv) CORRESPONDENCE ADDRESS : 

; ■ .(A). ADDRESSEE: SKJERVEN, MORRILL, MACPHERSON, ETAL. 
: (B) STREET: 25 Metro Drive, Suite 700;: 
(C) CITY: San Jose 
• (D) STATE: California 

(E) COUNTRY: USA 
. (F) ZIP: 95110 
(v) COMPUTER READABLE FORM : 

(A) MEDIUM TYPE: 3. 5 inch, 1.44 Mb floppy disk: 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS ; 

(D) SOFTWARE: WinPatin ( Gene n tech) 
(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER : 

(B) FILING DATE: 

(C) CLASSIFICATION: 
(viii) ATTORNEY /AGENT INFORMATION : 

(A) NAME: Terli2zi, Laura 

(B) REGISTRATION NUMBER.: 31 ,307 

( C ) REFERENCE /DOCKET NUMBER: M-3897 US- 
fix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE : (408) 453-9200 

(B) TELEFAX :( 408 ) 4 53-7979 

(2) INFORMATION FOR SEQ ID. NO : 1 : 
(i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1503 base pairs 

( B ) TYPE: Nucleic Acid 
{£) STRANDEDNESS: Single. 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1 : 

GGG GTA CCT GTG TGG AAG GAA GCA ACC ACC ACT CTA 36: 
. Glv Val Pro Val Trp Lys. Glu Ala Thr Thr Thr Leu ; . V 

";i'-' ; ...5 ■; 10 - 

TTT TGT GCA TCA GAT GCT AAA GCA TAT GAC AC A .GAG : -GTG ; ; ;7 : 5.' 
Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp Thr Glu Val :V 

is ■ ■ , 20 :i - • .' 25 : 

CAT A AT GTT TGG GCC AC A CAT GCT TGT GTA C C C AC A G AC ". 114 • ■ 
His As n Val Trp Ala thr His Ala Cys Vai Pro Thr Asp . 

;V^»M.--:.:' : : ; v- • 30 •., V ' ' 35 • : •'. ... 

CCA AAC CCA CAA GAA ATG GTA TTG GAA AAT GTG ACA GAA 153 
Pro Asn Pro Gin Glu Met Val Leu Glu Asn Val Thr. Glu : : 

: 40 ■ y 4 : 5 ■ 50 

•GAT TTT AAC ATG TGG AAA AAT GAC ATG GTA GAA CAG ATG: 192 
Asp Phe Asn Met Trp Lys Asn Asp Met : Val Glu Gin Met: 

'■ ' 55 . - ; eo 

■CAT GAG GAT ATA ATC AGT TTA TGG GAT CAA AGC CTA AAA: 231 
His Glu Asp lie lie Ser Leu Trp Asp Gin Ser Leu Lys 

. 65 . ' .'- ■ ■ ■ 70, 75 . 
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CCA TGT GTA AAA TTA ACC CCA CTC TGT ATT ACT TTA AAT 270 

Pro Cys Vai Lys L u Thr Pro Leu Cys lie Thr Leu: Ash ■■. 

: \'V.' 80 ; : .' '8 5 ■■: ;}::."■ \ 90 j ;'\ 

5 TGC ACC AAT TGG AAG AAG AAT GAT ACT AAA ACT . AAT AGT 309 

Cys Thr; Asn Trp Lys Lys Asn Asp Thr Lys Thr Asn Ser 
95 • 100 

AGT AGT ACT ACA ACT AAT AAT AGT AGT GCT ACA GCT AAT 348; 

16 Ser Ser Thr Thr Thr Asn Ash Ser Ser. Ala Thr Ala Asn 

105 110 115 



4 0 



60 



AGT AGT AGT ACT ACA ACT AAT AGT AGT TGG GCA GAG ATA 387. 

Ser Ser Ser Thr Thr Thr Asn Ser Ser Trp Gly Glu lie 
15 ; ' . 120 125 

AAG GAG GGA GAA ATA AAG AAC TGC TCT TTC AAT ATC ACC 426 

Lys Glu Gly Glu lie Lys Asn Cys Ser Phe Asn lie Thr 

130 135 140 

20 • 

ACA AGC ATA AG A GAC AAG GTG AAG AAA GAA TAT GCA CTT ,465 

Thr Ser lie Arg Asp Lys Val Lys Lys Glu Tyr Ala Leu .. . 

;145 . • ' 150 . 155 

2 5 TTT TAT AGC CTT GAT GTA GTA CCA ATA GAA AAT GAT AAT 504 

Phe Tyr . Ser Leu Asp Val Val Pro lie Glu Asn Asp Ash: 

160 ■ 165 

ACT AGC.; TAT AGG TTG AGA AGT TGT AAC ACC TCA CTC ATT 543 

30 Thr Ser. Tyr Arg Leu Arg Ser Cys Asn Thr Ser Val lie 

170. 175 180 

ACA CAA GCC TGT CCA AAG GTA ACT TTT GAG CCA ATT CCC 582 

Thr Gin Ala Cys Pro Lys Val Thr Phe Glu Pro lie Pro 
35 185 . 190 

ATA CAT TAT TGT ACC CCG. GCT GGT TTT GCG ATT CTC AAG 621 

. lie His Tyr Cys Thr Pro Ala Gly Phe Ala lie Leu Lys ; 

195 200 205 



TGT AGA GAT AAA AAG TTC AAT GGA ACA GGA CCA TGC AAA 660; 
Cys Arg Asp Lys Lys Phe Asn Gly Thr Gly Pro Cys Lys 

•/ -.; 210 ; - - 215 ' 220 : 



45 AAT GTT AGC ACA GTA CAA TGT GCA CAT GGA ATT AAG CCA 699 

Asn Val Ser. Thr Val Gin Cys Ala. His Gly He Lys Pro 

. ; ':'V...:225 . 230 ' ' 

GTA GTG TCA ACT CAA CTG CTG TTA. AAT GGC AGC CTA GCA 738 

50 Val Val. ;Ser Thr Gin: Leu Leu Leu Ash Gly Ser Leu Ala 
"•' 23 5 240 ' 24 5 

GAA GAA GAG GTA ATA ATT AGA TCT GCC AAT TTC TCA AAC 777 

Glu Giu Glu Val lie lie Arg Ser. Ala asm Phe Ser. Ash 
55 250 " 25 5 

AAT GCT AAA ATC ATA ATA GTA CAG TTG AGG GAA CCT GTA 81.6 

Asn Ala Lys lie I le He Val Gin Leu Arg Glu Pro Val 

260 ; 265 270 



GAA ATT AAT TGT ACA AGA CCC AGC AAC AAT ACA ATA AAA 855 
Glu lie Asn Cys Thr Arg Pro Ser Asn Asn Thr lie Lys 
'■ .275 ' . • .280." . 285 
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GGT ATA CAC ATA GGA CGA GGG AGA GCA. TTT TAT GCA ACA 894 

. :: G : Iv tie His lie Gly Pro Gly Arg Ala; Phe Tyr Ala .Thr ■.. 

66k <ZP& ATA C<3A GGA GAT ATA AGA CAA GCA CAT. TGT AAC 933 • 
Gly Asp lie Arg Gly Asp lie Arg Gin Ala : His :Cys Asn 

■; 300 : ■ ■ ' : 305 ■/■'■/; '3.10. .•^•,. V :;;^v.:.v.y ; v : 

ATT ACT GGA GCA AAA. TGG AAT AAC ACT TTA A AG. A AG GTA 972 
lie Ser Gly Ala Lys Trp Asn Asn Thr Leu Lys Lys Val ■ 
315 ■ 320. 

GTT AAA AAA TTA AAA GAA CAA TTT CCA AAT AAA ACA ATA 1011 
Val Lys Lys Leu Lys Glu Gin Phe Pro Asn Lys Thr lie : 
: 325 330 335 

GTC TTT AAC CAT TCC TCA GGA GGG GAC CCA GAA ATT GTA 1050 
Val Phe Asn His Ser Ser Gly Gly Asp Pro Glu lie Val 
340 345 350 

ATG CAC AGT TTT AAT TGT CAA GGG GAA TTT TTC TAC TGT 1089 
Met His : Ser Phe Asn Cys Gin Gly Glu Phe Phe Tyr Cys 

. 355 ■■ 360 

25- AAT ACA ACA AAG CTG TTT AAT AGT ACT TGG AAT GAT ACT. 1128.. 

Asn Thr Thr Lys Leu Phe Asn Ser Thr Trp Asn Asp Thr, •: • 
365 370 ■ ,'- 375 

ACA GAG TCA AAT AAC AAT GAT AGT ACT ATT ACA CTC CCA. 1167 : 
3 0 Thr Glu Ser Ash Asn Asn Asp Ser Thr lie Thr Leu Pro 

380 ... 385 

TGC AGA ATA AAA CAA ATT ATA AAC ATG TGG CAG GAA ATA. 1206 
Cys Arq lie Lys Gin lie lie Asn Met Trp Gin Glu Ile ; ■ . 
35 390,. 7 395 : . 400 

GGA AAA GCA ATG TAT GCC CCT CCC ACC AGA GGA GAA ATT 1245 
Gly Lys Ala Met Tyr Ala Pro Pro. Thr Arg Gly Glu lie 

• : -- -• : --405 : :; .4ao • 41$ 

•AAA TGT TCA TCA AAT ATT ACA GGA CTA CTG TTA ATA AGA. 1284 
Lys CVs Ser Ser Asn lie Thr Gly Leu Leu Leu lie Arg . . 

... '•: : v 420 . . . . 425 .. '^A- /.:: 

45 GAT GGT GGT ATT AAC ACT KGC GAT GCC ACC GAG ACC TTC 1323.; 

Asp Gly Gly lie Asn Thr Ser Asp Ala Thr Glu Thr Phe . 

430.-J.;. ; .-435 • 440 V' : . : V;.:' : :'V-. • 

AGA CCG GGA GGA GGA GAT ATG AGG GAC AAT TGG AGA AGT 1362 
50 Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser 

- . 445 ' ' ' • . 450 

GAA TTA TAT AAA TAT AAA GTA GTG AAA ATT GAG CCA TTA 1401 
Glu Leu tyr Lys Tyr Lys Val Val Lys lie Glu Pro Leu 
55 455 . 460 : 465. 

: GGA GTA GCA CCC ACC AAG GCA AAG AGA AGA GTG GTG CAG 144Q ; 
Gly Val Ala Pro Thr Lys Ala Lys Arg Arg Val Val Gin : ; 

. : 470 4 75 .. 4 80 • • 

6<? : . AGA GAA : AAA AGA GCA GTA ACA CTA GGA GCT ATG TTC CTT. 1479 ; 
Arq Glu Lys Arg Ala Val Thr Leu Gly Ala Met Phe Leu • 

. y - . 485 490 



40 
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GGG TTC TTA GGA GCA TAA AGC TTC 1503 
Gly Phe Leu Gly Ala Xaa Ser Phe 

. ■■;'[;-■■. . :. 500 501 - 

5 (2) INFORMATION - FOR SEQ ID NO: 2: 
( i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 501 amino acids . 

(B) TYpiE: Amino Acid- ■ "' 
(D) TOPOLOGY: Linear 

10 (xi) SEQUENCE .DESCRIPTION: SEQ ID NO:2: 

Gly Val Pro Val Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala 

l 5 - 10 "15 

15 Ser Asp Ala Lys Ala Tyr Asp Thr Glu Val His Asn Val Trp Ala 

20 25 ' • 30 



20 



35 



50 



60 



65 



Thr His. Ala Cys Val Pro Thr Asp Pro Ash . Pro Gin Glu Met Val 

-35 . . 40 4 5 

Leu Glu Asn Val .. Thr : Glu Asp Phe Asn Met Trp Lys Asn Asp Met 

50 ' 55 ' 60 



Val Glu Gin Met His Glu Asp lie lie Ser Leu Trp Asp Gln : Ser 

25 65 70 75 

Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys lie Thr Leu Asn 

80 85 90 

3 0 Cys thr Ash Trp Lys Lys Asn Asp Thr Lys Thr Asn Ser Ser Ser 

95 100 105 



Thr Thr Thr Asn Asn Ser Ser Ala Thr Ala Ash Ser Ser Ser Thr 

.110 •:• 115. 120 

Thr Thr Asn Ser Ser Trip Gly Glu lie Lys Glu Gly Glu He Lys 

... 12 5 130 135 



Asn Cys Ser Phe Asn lie Thr Thr Ser lie Arg Asp Lys Val Lys 

4 0 140 145 150 

Lys Glu Tyr Ala Leu Phe Tyr Ser Leu Asp Val Val Pro lie Glu: 

■ • . -15 s ISO 165 

4 5 Asn Asp Asn Thr Ser Tyr Arg Leu Arg Ser Cys Asn Thr Ser Val 

170 175 180 



lie. Thr. Gin Lys Val Thr Phe Glu Pro He Pro Tie 

185 ' ■ 190 : ' ' ' 195 

His Tyr Cys. Thr Pro Ala Gly Phe Ala lie Leu Lys Cys Arg Asp 

200 205 . 210 



Lys Lys Phe Asn Gly . Thr Gly Pro Cys Lys Asn Val Ser Thr Val 

55 ; 215 220 . . 225- 



Gin Cys Ala His Gly lie Lys Pro Val Val Ser Thr Gin Leu Leu 

230 .235 240 

Leu Asn Gly Ser Leu Ala Glu Glu Glu Val He He Arg Ser Ala 

; : 2:45 •:. • 250 • 255 

Asn Phe Ser Asn Ash Ala Lys lie He lie Val Gin Leu Arg Glu 

260 265 270 
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Pro Val Glu lie Asn Cys Thr Arg Pro Ser Asn Asn Thr lie: Lys 

275 . . 280 - 285y 

Glv lie His He Gly Pro Gly: Arg Ala Phe Tyr Ala Thr Gly : Asp 

: 290 295 : 300 



tie Arg Gly Asp lie Arg Glh -Ala - His Cys Asn lie Ser Gly Ala 

305 310 . 315 

10 Lys Trp Asn Asn Thr Leu Lys ; Lys; Val Val Lys. Lys. Leu Lys Glu 

320 325 330 

Gin Phe Pro Asn Lys Thr lie Val Phe Asn His Ser Ser Gly. Gly 

• 335 340 • 345 



15 



Asp Pro Glu He Val Met His Ser Phe Asn Cys Gin Gly Glu Phe 
. 350 355 360 

Phe Tyr Cys Asn Thr Thr Lys Leu Phe Asn Ser Thr Trp Asn Asp 
20 • . 365 370 . • 375 

Thr Thr Glu Ser Asn Asn Asn Asp Ser Thr lie Thr Leu Pro Cys 
380 385 390 

25 Arq lie Lys Gin He lie Asn Met Trp Gin Glu lie Gly Lys TVla 

.. * A : • • • ; 395 . 400 .. - : 405. 

Met Tvr Ala Pro Pro Thr Arg Gly Glu He Lys Cys Ser Ser Asn 

... : : ' .410" ■ 415 ■'■:4.2'0 

30 " - : 

lie Thr Glv Leu Leu. Leu lie Arg Asp Gly Gly He, Asn Thr Ser 

425 430 • . 435 

Asp Ala Thr Glu Thr Phe Arg Pro Gly Gly Gly Asp Met Arg Asp 
35 : . 440 445 .450 

Asn Trp Arg Ser Glu Leu Tyr : Lys Tyr Lys Val Val Lys lie Glu 
; 455 4 60 :" 465 

40 Pro Leu Gly Val Ala Pro Thr. Lys Ala Lys Arg Arg. Val ;Va.l ,Gln 
■ - -'" 470 47 5 480 : 

Arg Giu Lys Arg Ala Val Thr Leu Gly Ala Met Phe Leu CLy ?hj : 

• ' ■■=■,; 485 . ' ' ' 490 • , ::v./': ^95 : 

. Leu Gly Ala Xaa Ser Phe 

■ ■ ' 500 501 ' • . : 7. .• 

(2 ) INFORMATION FOR SEQ ID NO: 3.: 
50 (i) SEQUENCE CHARACTERISTICS: 

: (A) LENGTH: 1503 base pairs 

(B) TYPE: Nucleic Acid ; 

(C) STRAND ED NESS : Single 

(D) TOPOLOGY: Linear 

55 . (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 : 

GGG GTA CCT GTA TGG AAA GAA GCA ACC ACC ACT CTA 36 ; ; 
Gly Val Pro Val Trp Lys Glu Ala Thr Thr Thr Leu .;. 

' . i,... . . 5 "■ 10 

■60 ' ■ ■ •-•V- ••. . :• . . .i':" \ :' ■ 'Y/:'': . : : . .V: . f : \ 

TTT TGT GCA TCA GAT GCT AAA GCA TAT GAC ACA GAG GTG ; 75 . : ; 
Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp Thr Glu Val 



15 . . ■ ' '. 20. : " . - 25 
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CAT AAT GT.T. TGG GCC ACA CAT GCT TGT GTA CCC ACA GAC 114. 

His Asn V a 1 Trp A 1 a thr His A l a Cy s V a 1 P r o Th r A s p 

-'■V-v : . _ :■ . ■ ■ . '-^y^TW . ' v- .. / ' .' '.35. v.'. : ;y : : : •v : "\\ . • 

.5 CCA AAC CCA: CAA G/VA ATG GTA TTG GAA AAT GTG ACA GAA 153 

Pro Asn Pro Gin Giu Met Val Leu Glu Asn Val thr Glu . 
. 40 45 " 50 

GAT TTT AAC ATG TGG AAA AAT GAC ATG GTA GAA. CAG ATG 192, 

10 Asp Phe Asn Met Trp Lys Asn Asp Met Val Glu Gin Met 

55 . 60 

CAT GAG ANT ATA ATC ACT TTA TGG GAT CAA AGC CTA AAA 231 

His Giu Xaa lie lie Ser Leu Trp Asp Gin Ser Leu Lys 
15 65 . 70 75 

CCA. TGT GTA AAA TTA : ACC CCA CTC TGT ATT ACT ■ TTA AAT 2 70 

Pro Cys Val Lys Leu Thr Pro Leu Cys lie Tin- Lou Asn 
80 ■: 85 90 

20 

TGC ACC AAT TGG A AG GAG. AAT GAT ACT AAA ACT AAT AGT 309 

Cys Thr Asn Trp Lys Glu Asn Asp Thr Lys Thr Asn Ser 
- 95 100 

25 AGT AGT ACT ACA ACT AAT AAT AGT ACT GCT ACA GCT AAT 348 

Ser Ser Thr Thr Thr Asn Asn Ser Ser Ala Thr Ala A9n 
105 . 110 115 



3 0 
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60 



AGT AGT AGT ACT ACA ACT AAT AGT AGT TGG CGA GAC ATA 387 
Ser Ser Ser T.Hr. Thr Thr Ash Ser Ser Trp Gly Glu lie 

120 V. 125 



AAG GAG GGA GAA ATA A AG AAC TGC TCT TTC AAT ATC ACC 426 
Lys Glu Gly Glu lie Lys Asn Cys Ser Phe Asn lie Thr 
35 130 135 140 



ACA GGC ATA AGA GAC AAG GTG AAG AAA GAA TAT CCA CTT 465 

Thr Gly He Arg Asp Lys Val Lys Lys Glu Tyr Ala Leu 
145 150 . . 155 

TTT TAT AGC CTT GAT GTA GTA CCA ATA GAA AAT GAT AAT 504 

Phe Tyr Ser Leu Asp Val Val Pro He Giu Asn Asp Asn . . 

■■\ ; ;^ : :i:60\ ■ ":.;=;):'.;■ 165 

4 5 ACT AGC TAT AGG TTG . AGA AGT TGT AAC ACC TCA CTC ATT 54 3 

thr Ser Tyr Arg Leu Arg Ser Cys Asn Thr. Ser Val lie 
170 ,175 . 180 

ACA CAA GCC TGT CCA AAG GTA ACT TTT GAG CCA ATT CCC 582 

50 Thr Gin Ala Cys Pro Lys Val Thr Phe Glu Pro lie Pro 

185 . 190 

ATA CAT TAT TGT ACC CCG GCT GGT TTT GCG ATT CTG AAG 621 

lie His Tyr. Cys Thr Pro Ala Gly Phe Ala: lie Leu Lys 

55 195 : 200 205 

TGT AAA GAT AAA AAG TTC AAT GGA ACA GGA CCA TGC AAA 660 

Cys Lys Asp Lys Lys Phe Asn Gly Thr Cly Pro Cys Lys 

■ 210 / . 215 ; ■ . ' 220 



AAT GTT AGC ACA; GTA CAA TGT ACA CAT GGA ATT AAC CCA 699 
Asn Val Ser Thr Val Gin Cys Thr His Gly lie Lys Pro 

225 ; . 230 . 
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GTA GTG TCA ACT CAA CTG CTG TTA AAT GGG ACC CTA GCA 738 

Val Vai .Ser : T Leu Leu Asn Gly; Ser Leu Ala 

: 23'5"::V . '. ..240, ■ V ■ ■. 245 

GAA GAA GAG GTA ATA ATT AG A TCT GCC AAT . TTC TCA AAC 777 
Glu Glu Glu Val - l ie lie Arg Ser Ala Asn Phe Ser Asn 

:• 250 • 255 

AAT G CT AAA : ATC ATA ATA GTA CAG TTG AAG GAA CCT GTA 816 
Asn Ala Lys Tie lie lie Val Gin Leu Lys Glu Pro Val 

260 265 . : 270 

GAA ATT AAT TGT ACA AGA CCC AGC AAC AAT .AC A ATA AAA 85 5 
Glu Tie Asn Cys Thr Arg Pro Ser Asn Asn Thr He Lys 

15 • '; 2'75 ; 280 285 

GGT ATA CAC ATA GGA CCA GGG AGA GCA TTT TAT GCA ACA 894 
Gly lie His lie Gly Pro Gly Arg Ala Phe Tyr Ala Thr 
290 295 

GGA G AC ATA CG A GGA GAT ATA AGA CAA GCA CAT TGT AAC 933 
Gly Asp lie Arg Gly Asp, He Arg Gin Ala . His Cys Asn 
300 305 310 . 

25 ATT AG T GGA GCA AAA TGG AAT AAC ACT TTA AAG AAG GTA 97 2 

He Ser Gly Ala Lys frp Asn Asn Thr Leu Lys Lys Val 

, : 320 

GTT ATA AAA TTA AAA GAA CAA TTT CCA AAT AAA ACA ATA 1011 

30 Val He Lys Leu Lys Glu Gin Phe Pro Ash Lys Thr He 
325 " ' 330, . 335 

GTC TTT AAC CAT TCC TCA GGA GGG GAC CCA GAA ATT GTA 10,50 

Val Phe Asn His Ser Ser Gly Gly Asp Pro. Glu He Val 

35 340 : 345 350 

ATG CAC AGT TTT AAT TGT CAA GGG GAA TTT -TTC TAG TGT 1089 
Met His Ser Phe Asn Cys Gin Gly Glu, Phe Phe Tyr Cys 
355 ., ■ • .360 

4 0 ' . - 

AAT ACA ACG .AAG CTG TTT AAT AGT ACT TGG AAT GAT ACT 1128 
Asn Thr Thr Lys Leu Phe Asn Ser Thr Trp Asn Asp Thr 
.365' ,3;70- '• /. 375 , 

45 ACA GAG TCA AAT AAC AAT GAT AGT ACT ATT. ACA CTC CCA 11.67. 

Thr Glu Ser Asn Asn Asn Asp Ser Thr He Thr Leu Pro, 

V:: ::380;'7.- : '" ' 385 • • • ' 

TGC AGA ATA AAA CAA ATT ATA AAC ATG TGG CAG : GAA GTA 1206, 
50 Cys Arg He ; Lys Gin He He Asn Met Trp Gin Glu Val 

390'.- 395. . '400 . 

GGA AAA GCA ATG TAT GCC CCT CCC ATC AGA GGA GAA ATT 1245; 
Gly Lys Ala Met Pro Pro He Arg Gly Glu He 

55 410 415 

AAA TGT TCA TCA AAT ATT ACA GGA CTA CTG TTA ACA AGA 1284 
Lys Cys Ser Ser Asn lie Thr Gly Leu Leu Leu Thr Arg 

■ - : . 420 425. : - 



60 



GAT GGT GGT ATT AAC ACT AGC GAT GCC ACC GAG ACC TTC 1323 
Asp Gly Gly lie Asn Thr Ser Asp Ala Thr Glu Thr Phe 

430 ■ ■ ■ - "\-. v "" 435-. \ 440 
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AGA CCG GGA GGA GGA GAT ATG AGG GAC AAT TG<?. AGA AGT 1362 
Arg Pro Gl y Gly Gly Asp Met: Arg Asp Asn trp Arg Ser 

-4-4:5::,:::;:- v -;: ! i V.V : : . • . 450 . .• 

•,•5;. GAA TTA TAT AAA TAT -AAA GTA GTG AAA ATT: GAG CCA TTA 1401 
•'••i : :Glu Leu Tyr Lys Tyr^Lys Val Val Lys lie Glu Pro Leii 

. . . 455 46 0-. . . . 465: • 

GGA GTA GGA CCC ACC AAG GCA A AG AGA AGA GTG GTG CAG 1440 
10 Gly Val Ala Pro thr Lys Ala Lys Arg Arg Val Val Gin 
470 475 480 

AGA GAA AAA AGA GCA GTA ACA CTA GGA GCT ATG TTC CTT 1479 
Arg Glu Lys Arg Ala Val Thr Leu Gly Ala Met Phe Leu 

15 " -iV: : : :485.: : : - : > 490 

GGG TTC TTG GGA GCA TAA AGC TTC 1503 

Gly. Phe Leu Gly Ala Xaa Ser Phe 

495 500 501 

20 



30 



45 



60 



(2) INFORMATION FOR SEQ ID NO: 4: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 501 amino acids 

(B) TYPE : Amino Acid 
25 (D) TOPOLOGY: Linear 

(Xi). SEQUENCE DESCRIPTION : SEQ I D NO : 4 : 



Gly .Val Pro Val Trp : .Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala 

1 - r -5 V ; ; • 10 15 

Ser Asp Ala Lys Ala Tyr Asp Thr Glu Val His Asn Val Trp Ala 

20; •: 25 30 



. Thr His Ala Cys Val Pro .Thr. Asp Pro Asn Pro Gin Glu Met Val 

35 35 40 45 

Leu Glu Asn Val Thr . Glu Asp Phe Asn Met Trp Lys Asn Asp Met 

50 : 55 60 

40 . Val Glu Gin Met His Glu Xaa lie lie Ser Leu trp Asp Gin Ser 

... 65.:;;; •.• ' 70 .. -.75,. 



Leu Lys Pro Cys Val Lys Led thr Pro Leu Cys lie thr Leu Asn 

■ • . :80 . .';.••/•; ■ 85 . ' 90 

Cys Thr Asn Trp Lys Glu Asn Asp Thr Lys Thr Asn Ser Ser Ser 

' 95 ' v. : ' • . 100 , • 105 



Thr Thr Thr Asn Asn Ser Ser Ala Thr Ala Asn Ser Ser Ser Thr 
50 .110 115 120 

Thr Thr Asn Ser Ser Trp Gly Glu lie Lys Glu Gly Glu He Lys 

• : - 125-. : . 130 ■ \ . .. 135 

55 Asn Cys Ser Phe Asn lie Thr Thr Gly lie Arg Asp Lys Val Lys 
• • • '140 ; 'V 14 5 150 



Lys Glu Tyr Ala Leu Phe Tyr Ser Leu Asp Val Val Pro lie Glu 

'; 1S5 : -V : ;' ■■ 160 • 165 

Asn Asp Asn. Thr Ser Tyr Arg: Leu Arg Ser Cys Asn thr Ser Val 

170 175 180 



lie Thr Gin Ala Cys Pro Lys Val Thr Phe Glu Pro lie Pro He 

65 . . 185 • ; ; 190 195 
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His: Tvr cys Thr Pro Ala Gly Phe Ala lie Leu Lys Cys Lys Asp 

" .• ' •2pG ; v: : ;::; : :j;:-.:v : :- : ; 205 210 

Lye Lys Phe Asn G 1 y Thr G 1 y P r o Cy s Ly s Asn Val Ser Thr Val : 

• - * ■ • ' ■ 2 15 V" 220 ^ ..; 225 

Gin Cys Thr His Gly lie Lys Pro Val Val Ser. Thr Gin Leu Leu 

• • . ■ 2i& v :-y- : U.r 235. " : 240. 

Leu Asn Gly Ser Leu Ala Glu Glu Glu Val He He Arg Ser Ala 

245. 250 255 

Asn Phe Ser Asn Asn Ala Lys lie He lie Val Gin Leu Lys Glu 

260 . -" • • 265 ■ ' 270 

Pro Val Glu He Asn Cys Thr Arg . Pro Ser Asn Asn Thr He Lys. 

275 280 . 285 

. Glv He His He Gly Pro Gly Arg Ala Phe Tyr Ala Thr Gly Asp 

■ 290 - . 295 300 

He Arg Gly Asp lie Arg Gln ; Ala His Cys Asn He Ser Gly Ala 

305- 310 315 



25 Lys Trp Asn Asn Thr- Leu.v.Lys. Lys Val Val He Lys: Leu Lys Glu. 



30 



45 



60 



Gin Phe Pro Asn Lys Thr lie Val Phe Asn His Ser Ser Gly Gly 

. . ' 335" ' ■ 340 ■ 345 

Asp Pro Glu lie Val Met" His Ser Phe Asn Cys Gin Gly Glu Phe 

* 350 355 360 



Phe Tyr Cys Asn Thr Thr Lys Leu ;Phe Asn Ser Thr Trp. Asn Asp 

35 365 ; 370 375 

Thr Thr Glu Ser Asn Asn Asn Asp Ser Thr lie Thr Leu Pro Cys 

380 • 385 390 

40 Arg lie Lys Gin lie- I le Ash Met Trp Gin Glu Val Giy : Lys Ala 

395 . . , ; 400 405 



Met Tvr Ala Pro Pro lie Arg Gly Glu lie Lys Cys Ser Ser Asn 

. ■ 410 , .-415 ■ . - • 420 

lie Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly lie Asn Thr Ser 

... 425 430 435 

Asp Ala Thr Glu Thr Phe Arg Pro Gly Gly Gly Asp Met Arg Asp 

50 440 445 450 

Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val . Val Lys He. Glu 

: 455 460 465 

55 Pro Leu Gly Val Ala Pro Thr Lys Ala Lys Arg .Arg Val Val Gin 

470 475 480 



Ara Glu Lys Arq Ala Val Thr Leu Gly Ala Met Phe Leu Gly Phe 

.485.: \ 490 . . 495 

Leu Gly Ala Xaa Ser Phe - ; ; 

500 501 • ' 



(2) INFORMATION FOR SEQ ID NO: 5 : 
65 (i) SEQUENCE CHARACTERISTICS: 
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(A): LENGTH: .1461 base pairs 
.. .' (B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 
■ ■ ( D ) TOPOLOGY : Linear • ..; :;V: •" f ' 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5 : 

G . GTA GGT.GTA TGG AAA GAA GCA ACC ACC ACT CTA TTT 37 
Val Pro Val Trp Lys Glu Ala Thr Thr Thr Leu Phe \" 

1 ." 5 . 10 

TGT GCA TCA GAT GCT AAA GCA TAT GAT ACA GAG GTA CAT 76 
Cys Ala Ser Asp Ala Lys Ala Tyr Asp thr Glu Val His 

15 ■' • 20'."v; :; .• 25 



15 AAT GTT TGG GCT ACA CAT GCC TGT GTA CCC ACA GAC CCC 115 

Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro 

30 ... 35 

AAC CCA CAA. GAA GTA GTA TTG GAA AAT GTA ACA GAA AAT 154 
20 Asn Pro Gin Glu Val Vai Leu Glu Ash Val Thr Glu Asn 

40 45 50 

TTT AAC ATG TGG AAA AAT AAC ATG GTA GAA CAG . ATG CAT 193 
Phe Asn Met Trp Lys Asn Asn Met" Val Glu Gin Met His 
2 5 55 60 

GAG GAT ATA: ATC AGT TTA TGG GAT; CAA ACT. CTA A AG CCA 232 

Glu Asp lie lie Ser Leu Trp Asp- Gln : Ser Leu Lys Pro 

65 70 7 5 

30 . . : 

TGT GTA AAA TTA ACC CCA CTC TGT GTT ACT TTA AAT TGC 271 

Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu Asn Cys 

80 • 85 90 

35 ACT AAT TTG GAG AAT GCT AAT AAT ACC : GAG AAT GCT AAT 310 

Thr Asn Leu Glu Ash Ala Asn Ash Thr Glu Asn Ala Asn 
95 100 

AAT . ACC AAT AAT TAT ACC TTG GGG ATG; GAG AGA GGT GAA 349 
40 Asn Thr Asn Asn Tyr Thr Leu Gly Met Glu Arg Gly Glu 

. 105 ' 110 :• 115 

ATA AAA AAC: TGC TCT TTC AAT ATC ACC AC A ACt TTA AGA 388 

lie Lys Ash Cys Ser Phe Ash lie .Thr Thr Ser Leu Arg 
45 • -120 ; : :: 1 2 5 

GAT A AG GTG AAA AAA GAA TAT GGA TTG; TTT TAT AAA CTT 427 
Asp Lys Val Lys Lys Glu Tyr Ala :Leu Phe Tyr Lys Leu 
130 . 135 . 140 

50 , ■ . .. : -.'V:,'V:. :: - : ' : .'.'. : y'; • 

GAT GTA ; GTA CAA ATA GAT AAT AGT ACC AAC TAT AGG GTG 466 
Asp Val Val Gin lie Asp Ash Ser Thr. Ash Tyr Arg Leu 

145 . -.ISO;:.. 155 

55 ATA AGT TGT AAT ACC TCA GTC ATT ACA CAG GCC TGT CCA 505 

lie Ser Cys Asn Thr Ser Val lie Thr Gin Ala Cys Pro 

• 160 165' V':.-"^ . . . 

AAG GTA. TCC TTT GAG CTA ATT CCC ATA CAT TAT ;TGT GGC 544 . 
60 Lys Val Ser Phe Glu Leu He Pro lie His Tyr Cys Ala 

170 .175 •■ 180 

CCG GCT GGT TTT GCG ATT CTA AAG TGT AAA GAT AAG AAG 583 
Pro Ala Gly Phe Ala He Leii Lys Cys .Lys Asp Lys Lys 
65 185 190 
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15 



2 0 



TTC AAT GGA ACA GGA CCA TGT AAA AAT GTC AGC ACA GTA 622 

Phe Asn Gly Thr Gly Pro Cys Lye Asn Val ;S^r Thr Val 
' 195. ■ " 200...:: :.;•-.:•:..:; 20* ••• 

CAA TGT ACA CAT GGA ATT AG A CCA GTA : GTA TCA ACT CAA 661 , ; 

Gin Cys Thr His Gly He Arg Pro-Val: Val : Ser Thr Gin 

■ 210 .' .. • :; - : ; ; '. : v-'v::2i5v : vvv : ■■■;=; - 220 

CTA CTG TTA AAT GGC AGT CTA GCA GAA GAA GAG ATA GTA 700 
Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu lie Val 

225 ' ". 2 3Q 

ATT AGA TCT GAA AAT ATC ACA GAC AAT GCT AAA ACC ATA 739 
lie Arg Ser Glu Asn He Thr Asp Asn Ala Lys Thr He 
235 . 240 245 

ATA GTG CAG CTA AAT GAA TCT: ATA GTG ATT AAT TGT ACA 778 
He Val Gin Leu Ash Glu Ser He Val lie Asn Cys Thr 
250 255 

AGA CCC AAT AAC AAC ACA AGA AAA AGT. ATA AAT ATA GGA 817 
Arg Pro Asn Asn Asn Thr Arg Lys Ser lie Asn lie Gly 

260. 265, . 270 

2 5 CCA GGG AGA GCA TTC TAT ACA ACA GGA GAC ATA ATA GGA 856 

Pro Gly Arg Ala Phe TyriThr Thr Gly Asp lie lie Gly 

• 275 "... 280 285 

GAT ATA AGA CAA GCA CAT TGT AAC CTT AGT AAA ACA CAA 895 
30 Asp He Arg Gin Ala His. Cys Asn Leu Ser Lys Thr Gin; 

290 ' 2 ?* 

TGG GAA AAA ACG TTA AGA CAG ATA GCT ATA AAA TTA GAA 9 34 
Trp Glu Lys Thr Leu Arg Glh Ile.Ala lie Lys Leu Glu 
35 300 , 305 : 310 

GAA AAA TTT A AG AAT AAA ACA ATA GCC TTT AAT AAA TCC 973 

Glu Lys Phe Lys. Asn Lys Thr He Ala Phe Asn Lys Ser 

315 : . .-;/ ; 320 

40 TCA GGA GGG GAC CCA GAA .. ATT GTA . ATG CAC AGT TTT AAT 101 2 

Ser Gly Gly Asp Pro Glu He Val Met His Ser Phe Asn : ; 
325 330: ; ;.V ;335- 

45 TGT GGA. GGG GAA TTT TTC T AC TGT AAT ACA ACA AAA CTG 1051 

Cys Gly Gly Glu Phe Phe Tyr Cys: Asn Thr Thr Lys Leu ■ . 

'340.' 3.4.5; ; . 3 50 

TTT AAT AGT ACC TGG AAT TTA ACA: CAA CCG TTT AGT AAT 1090 
50 Phe Ash Ser Thr Trp Asn Leu Thr Gin Pro Phe Ser Asn 

355 • ■ • 360.- 

ACC GGG AAT CGT ACT GAA GAG TTA AAT ATT ACA CTC CCA 1129 
thr Gly Asn Arg Thr Glu Glu Leu Asn. lie Thr Leu Pro 

55 ' " 365 . 370 : . 375 . .. /■-.. 

TGC AGA ATA AAA CAA ATC ATA AAC TTG TGG CAG GAA GTA 1 168 
Cys Arg He Lys Gin He He Asn Leu Trp Gin Glu Val 
380 3B5 

60 GGC AAA GCA ATG TAT GCC CCT CiCC ATC AGA GGA CAA ATT 1207 

Gly Lys Ala Met Tyr Ala Pro Pro He Arg Gly Gin lie 

390 • ■ 395 -, 400 .. 
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. AGA TGT TCA TCA AAT ATT AQA GGG CTAr.CTA TTA ACA AGA 1246 
Arg Cys Ser Ser Asn lie Thr .Gly Leu:;Leu :Leu Thr Arg 

. . 405 '■ 410 • : ;•• .••••v.y'' : "^ : - . 

5 ; GAT GGT GGA AGT AAC ACC GGT GAC AAC AGG ACT GAG ACC 1285 
Asp Gly Gly Ser Ash Thr Gly Asp Ash Arg Thr Glu Thr 

' V ■■' • ' ■ ' 420v' ; ; : : v^;; : . \ : .' v'.^^ ' 

TTT AGA OCT GGA GGA GGA GAT ATG AGG GAC AAT TGG AGA 1324 
10 Phe Arg Pro Gly Gly Gly Asp Met Arg Asp . Asn Trp Arg 
430 • 435 . 440 

AGT GAA TTA TAT AAA TAT AAA GTA GTA AGA ATT GAA .CCA 1363 
; Ser Glu Leu Tyr Lys Tyr Lys Val Val Arcj lie Glu Pro 
15 ' 445- 450 

TTA GGA GTA GCA CCC. ACC CAG GCA AAG AGA AGA GTG GTG 1402 
Leu Gly Val Ala Pro Thr Gin Ala Lys Arg Arg Val Val 
455 460 465 . 

20. 

CAA AGA GAA AAA AGA GCA GTG GGG ATA GGA CCT ATG TTC 1441 
Gin Arg Glu Lys Arg Ala Val Gly lie Gly Ala Met Phe 
470 475 : 480 

• • _ ,■ . I . . • 

25 '•; CTT GGG TTC TTG GGA GAT A A 1461 
Leu Gly Phe Leu Gly Asp 
485 4 86 

• (2) INFORMATION FOR SEQ. ID. NO: 6: 
3 0 (i) SEQUENCE CHARACTER I STI CS : . 

(A) LENGTH: 486 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY : Linear. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



3 5 



50 



Val Pro Val Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser 
1 5 10 15 



Asp Ala Lys Ala Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr 
4 0 20 2 5 . 30 

His Ala Cys Val Pro Thr Asp Pro Asn Pro Clh Glu Val Val Leu 

' ". ■ . . 3 5; ■ 40 .: \'% .. ■ .;' C . 45 

4 5 Glu Asn Val Thr Glu Asn Phe Asn Met Trp Lys Asn Asn Met Val 

50 " ;55 .60 



Glu Glh Met His Glu Asp lie lie Ser Leu Trp' Asp G lh Ser Leu 

" 65 : -. . ' -70; : : ' ' '.• 75 

Lys Pro Cys Val Lys. Leu Thr Pro Leu Cys Val: Thr Leu Asn Cys 

80 ■ .85 90 



Thr Asn Leu Glu . Asn Ala Asn Asn Thr Glu Asiv :Ala Asn Asn Thr 

55 ;.- 95 100 ; 105 

Asn Ash Tyr Thr Leu Gly Met Glu Arg Gly Glu Ile : Lys Ash Cys 

110 _ ' 1 15 ; 120 

60 Ser Phe Asn He Thr Thr Ser Leu Arg Asp Lys Val Lys Lys Glu 

- 125. ■ .-■ 130 • . 135 



65 



Tyr Ala Leu Phe Tyr Lys Leu Asp Val Val Gin lie Asp Asn Ser. 

140 14 5 ; . : ... :; ' ; ..V- : ' : • ; 150 
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. Thr Asn Tyr Arg: Leu: He Ser Gys Asn .Thr Ser. Val. lie Thr Gin 

, : : : . : -' ••• : i5S :: .'V;. : 160:V. : : ^ K . . . ■ ' ' - 165;. 

Ala Cys Pro Lys Val Ser Phe Glui iLeu " lie Pro lie: ttis : Tyr: Cys 
5 ■■■ .170 ' ; : 175; 7.;;. 180 

: : Ala Pro Ala Gly Phe: Ala l ie Leu Lys Cys Lys ; Asp Lys Lys Phe. 

185 ; 190 \ ... 195 

10 Ash Gly Thr Gly Pro Cys Lys . Asn Val Ser Thr Val Gin Cys Thr 

200 205 210 



15 



30 



45 



60 



His Glv lie Aro Pro Val Val Ser Thr Gin Leu. Leu Leu Ash Gly 

215 . 220 _ 22 5 

Ser Leu Ala Glu Glu Glu He Val lie Arg Ser Glu Asn He Thr 

230 235 : 240 



Asp Asn Ala Lys Thr lie He Val Gin Leu Asn Glu Ser He Val 

2 0 24 5 250: 255 

lie Asn Cys Thr Arg Pro Asn Asn Ash Thr Airg Lys Ser He Asn 

260 ■ ,:;265., 270 

25 He Gly Pro Gly Arg Ala Phe Tyr Thr Thr Gly Asp He He Gly 

. 2:75 .280 - , 285 



Asp He Arg Gin Ala His Cys Asn Leu Ser Lys Thr Gin Trp Glu 

290 . 295 ; ,. 300 

Lys Thr Leu Arg Gin He Ala He Lys Leu Glu Glu Lys Phe Lys 

305 310 315 



Asn Lys Thr He Ala Phe Asn Lys Ser. Ser Gly Gly Asp Pro Glu 
35 320 325 330 

. He val Met His Ser Phe Asn Cys Gly Gly Glu Phe : Phe Tyr Cys. 

335: 340 345 

4 0 Asn thr thr Lys Leu Phe Asn Ser Thr Trp Ash Leu Thr Gin Pro 

350 . .. 3 55.:. 360 



Phe Ser Asn Thr Gly Asn Arg Thr Glu Glu ; Leu : Asn lie Thr Leu 

365 ■ "37.9; /;'; . 375 

Pro Cvs Arq lie Lys Gin lie lie Ash Leu Trp Gin .Glu Val Gly 

380 385';: .-.v 390 



Lys Ala Met Tyr Ala Pro Pro lie Arg Gly Gin lie Arg : Cys Ser 

50 ■ . ' ■ 3?5 • -400 :. • . 405. 

Ser Asn He Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly. Ser Asn 

410 ' ■ 415- ...:. ,. 420 



55 Thr Gly Asp Asn Arg Thr Glu Thr Phe Arg Pro Gly Gly .Gly Asp 
.... 425' .. . 430. , .435 



Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys tyr Lys Val Val 
440 v 44 5 : 4 50 

Arg He Glu Pro Leu Gly Val Ala Pro .thr . Gin Ala Lys Arg Arg 
455 . . 460 465 



Val Val Gin Arg Glu Lys Arg Ala Val Gly lie • Gly Ala Met Phe 

65 470 . : 475 : v v : 480 
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Leu Gly Phe Leu Gly Asp 

•,:•;:.>;.' ' :■ ' ' 485 486 : : 

(2) INFORMATION FOR SEQ ID NO: 7: 
5 :: .. (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1474 base pairs 
■ ;. (B) TYPE: Nucleic Acid 

(C) STRAND ED NESS : Single 

(D) TOPOLOGY: Linear . 

10 • ( xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

: G GTA CCT GTG TGG AAA GAA GCA ACC ACC ACT CTA TTT 37 
Val Pro Val Trp Lys Glu Ala Thr Thr Thr Leu Phe 
15 1 5 10 

TGT GCA TCA GAT GCT AAA GCA TAT GAT AC A GAG GTA CAT 7 6 

Cys Ala Ser Asp Ala Lys Ala Tyr Asp Thr Glu Val His 
15 20 25 

20 

. AAT GTT TGG GCT ACA CAT GCC TGT GTA CCC ACA GAC: CCC 115 

Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro : 
30 35 

25 AAC CCA CAA GAA GTA GTA TTG GAA AAT GTA ACA GAA AAT 154 

Asn Pro Gin Glu Val Val Leu Glu Asn Val Thr Glu Asn 
40 45 50 

. TTT AAC ATG TGG AAA AAT AAC. ATG GTA GAA CAG ATG CAT. 193 
3 0 Phe Ash Met Trp Lys Ash Ash Met Val Glu. Gin Met His 

" : • . . . 55 60 

GAG GAT ATA ATC AGT TTA TGG GAT CAA AGT CTA A AG CCA- 232 
Glu Asp. lie He Ser Leu Trp Asp Gin Ser , Leu Lys Pro 
35 65 "= . 70 75 

TGT GTA AAA TTA ACC CCA CTC TGT GTT ACT TTA AAT TGC 271 
Cys Val. Lys Leu Thr Pro Leu Cys Val Thr Leu Asn Cys 

80 85 . 90 • 



40 



60 



ACT AAT TTG GAG AAT GCT AAT AAT ACC GAG AAT GCT AAT 310 
Thr Asn Leu Glu Asn Ala Asn Asn Thr Glu Asn Ala Asn 

95 .■ 100 



45 AAT: ACC AAT AAT TAT ACC TTG GGG ATG GAG AGA GGT GAA 34 9 

Asn Thr Asn Asn Tyr Thr Leu Gly Met Glu Arg Gly Glu 
105 . 110 , 115 

AGA AAA AAC TGC TCT TTC AAT ATC ACC ACA AGO TTA AGA 388 

50 Arg Lys Asn Cys Ser Phe Asn He thr Thr Ser. Leu ArgV 
120 . 12 5 

GAT AAG. GGG AAA AAA GAA TAT GCA TTG TTT TAT AAA ; CTT 427 

Asp Lys Gly Lys Lys Glu Tyr Ala Leu Phe Tyr Lys Leu 
55 130 . 135 .• 140 

GAT GTA GTA CAA ATA GAT AAT AGT ACC AAC TAT AGG CTG 466 : 

Asp Val Val Gin lie Asp Asn Ser Thr Asn Tyr Arg Leu 

145 150 . 155 ,'. 



ATA AGT TGT AAT ACC TCA GTC ATT ACA CAC GCC TGT CCA 505 
lie Ser Cys Asn Thr Ser Val ile Thr Glri Ala Cys Pro 
160 165 



-104- 



WO98/0I564 



PCT/US97/09690 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



AftG GTA TGC.TTT GAG CCA ATT 
Lys: Vai: Set Phe Glu Pro lie 

CCG GCT GGT TTT GCG ATT CTA 
Pro -Ala Gly :Phe Ala lie Leu 

185 - / \ ■ 

TTC AAT GGA ACA GGA CCA TGT 
Phe : Asii Giy Thr Gly Pro Cys 
195 200 

CAA TGT ACA CAT GGA ATT AG A 
Glh Cys Thr His Gly He Arg 
210 

CTA CTG TTA AAT GGC AGT CTA 
. Leu Leu Leu Asn Gly Ser Leu 

"■■225 

ATT AGA TCT GAA. AAT ATC ACA 
lie Arg Ser Glu Asn He Thr 

... : 235 . 240 

ATA GTG CAG CTA AAT GAA TCT 
He Val Gin Leu Asn Glu Ser 

: . ; -7v : :; '..250. • 

AGA CCCAAT AAC AAC ACA AGA 
Arg Pro Asn Ash Ash Thr Arg. 
260 1 - 265 

.. CCA GGG AGA . GCA TTC TAT ACA 
Pro Gly Arg Ala Phe Tyr Thr 
.275 

GAT ATA AGA CAA GCA CAT TGT 

Asp lie Arg Gin: Ala His Cys 
290 / ' 

TGG :; GAA AAA ACG TTA AGA CAG ; 
Trp Glu Lys Thr Leu Arg Gin 

7- :' : 300\,y' ' 305. 

GAA ;AAA ;TTT A AG AAT AAA ACA. 
Glu Lys Phe Lys Asn Lys Thr 

: v; ' : .-', : "-3i5 .•• . .. . ■ 

TCA GGA GGG GAC CCA GAA ATT. 
Ser Gly Gly Asp: Pro Glu lie 
325 • 330 

TGT GGA GGG GGA TTT TTC TAC 
Cys Gly Gly Gly Phe Phe Tyr 

" : 'V\ 340 • 

TTT AAT ; ; AGT ACC TGG AAT TTA 
Phe Ash Ser Thr Trp Asn Leu 

: '- . .355 

ACC GGG GAT CGT ACT GAA GAG 
Thr. Gly ..Asp Arg Thr Glu Glu 

365:..: 370 



CCC ATA CAT TAT TGT .GCC 544 
Pro lie His Tyr . Cys Ala ;; y; : : v 

; ' 180 ; ■: . 

AAG TGT AAA GAT AAG AAG 583 : 
Lys Cys Lys Asp Lys Lys 

: 190 . - : .. : ' ' ' y'y:' ] f:^}:} ::: 

AAA AAT GTC AGG ACA GTA 622 
Lys Ash Val. Arg Thr Val ; 
205 

CCA GTA GTA TCA ACT CAA, 661 ■ 
Pro Val Val Ser Thr Gin 
215 220: 

GCA GAA GAA GAG ATA GTA 700 
Ala Glu Glu Glu He Val 
230 

GAC AAT GCT AAA ACC ATA 739 
Asp Asn Ala Lys Thr lie 
245 

ATA GTG ATT AAT TGT ACA 778 . : 

He Val lie Asn Cys Thr ; ; 

■ ■ 2.55 

AAA AGT ATA AAT ATA GGA 817 
Lys Ser lie Asn Me Gly : 
270 

ACA GGA GAC ATA ATA GGA 856 
Thr Gly Asp lie lie Gly 

280 . . 285 

AAC CTT AGT AAA ACA CAA 895 
Asn Leu. Ser Lys Thr Gin ; 

' 295 

ATA GCT ATA AAA TTA GAA 934 

lie Ala lie Lys Leu Glu : - ! 

/' 310 ^ i'' i/- ; 1 

ATA GCC TTT AAT AAA TCC 973 . 

He Ala Phe Asn Lys Ser : 

' . 3 20 

GTA ATG CAC AGT TTT AAT 1012 

Val Met His Ser Phe Asn 

• 335 

TGT AGT ACG AGA AAA GTG 105 1- 

Cys Ser Thr Arg Lys Leu . 

34 5 350 

ACA CAA CCG TTT AGT AAT 1090 
Thr Gin Pro Phe Ser Asn ; 

, 360 .' 



■■ TTA AAT ATT ACA CTC. CCA 1129; 
Leu Asn lie Thr Leu Pro 

375 : : :/..V.- 
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TGC AG A ATA AAA CAA ATC : ATA AAC TTG TGG GAG : GAA GTA,;: 1168 
; : Cys Arg lie Lys Gin lie lie Asn Leu Trp Gin Glu Val 

• ': ' : } . 380 ; \ ".\\y :; ; v,' ■ ■ . ;3M • .' . :T: 'X- i: J : : ;V; : .'" : .- 

5 .;: GGC AAA GCA ATG TAT GCC ; dCT CCC ATC AGA GGA CAA ATT 1207 
Gly Lys. Ala Met : Tyr Ala Pro Pro lie Arg Gly Gin iie 
. - / 390 ; 3.95 ■^; : "--v»-y-': : :";-" : . " 400 

AGA TGT TCA TCA AAT ATT ACA GGG CTA CTA TTA AGG AGA 1246 
10 Arg Cys Ser Ser Asn lie Thr Gly Leu Leu Leu Arg Arg 

405 "-' 410 415 

GAT GGT GGA AGT AAC ACC AGT GAC AAC CAG ACT GAG ACC 1285 
Asp Gly Gly Ser Asn thr Ser Asp Asn Gin 'Thr Glu Thr 
15 420 . 425 

TTT AGA CCT GGG GGA GGA GAT ATG AQG GAC AAG TGG AGA 1324 

Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Lys Trp Arg 
430 4 35 4 40 

2 0 • 

AGT GAA TTA TAT AAA TAT AAA • GTA GTA AGA ATT GAA CCA 1363 

Ser Glu Leu Tyr Lys Tyr Lys Val val Arg He Glu Pro 

445 . 4 50 

25 . TTA GGA GTA GCA CCC ACC CAG GCA AAG AGA AGA GTG GTG 1402 
Leu Gly Val Ala Pro Thr Glh Ala Lys Arg Arg Val Val 
455 460 4 65 

CAA AGA GAA AAA AGA GCA GTG GGG ATA GGA GCT ATG TTC 1441 

3 0 Glh Arg Glu Lys Arg Ala Val Gly He Gly Ala Met Phe 

470 .475 480 



55 



CTT AGG TTC TTA GGA GAT AAA GCT TCT ACA GT.C 1474 
Leu Arg Phe Leu Gly Asp Lys Ala Ser Arg Val 
35 485 490 491 

(2) INFORMATION FOR SEQ ID NO:8: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 491 amino acids 
40 (B) TYPE: Amino Acid:: 

(D) TOPOLOGY: Linear 
(xij SEQUENCE DESCRIPTION : : SEQ; ID NO: 8 : 

Val Pro Val Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser 

4 5 1 5 ' ' 10 15 

As p A la Ly s Ala Tyr As p T hr CI u Val His Asn V a 1 . T rp Ala Thr 

: 20 25 • : 30 

50 His Ala Cys Val Pro Thr Asp Pro Asn Pro Gin Glu Val Val Leu 

35 '.. ;■ ■ 40 ; 45 



Glu Asn Val Thr Glu Ash Phe Asn Met Trp Lys Asn Asn Met Val 

50' • ^ 55.. ■ • : 60 

Glu Gin Met His Glu Asp lie l ie Ser. Leu Trp Asp Gin Ser Leu 

■ 65 .. ■ •;• .70 _ 7 5 



Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu Asn Cys 

6<D 80 : . . 85 •• 90 

Thr Asn Leu Glu Asn Ala. Asn Asn Thr GJu Asn Ala Asn Asn Thr 

" 95 ... : 100. ■ / 105 
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Ash Asn Tyr thr Leu Gly Met Glu Arg Gly Glu Arg Lys Asn Cys 

' : ;-. : v :: :': "liG,-.: IIS 

Ser Phe Asn l ie ; Thr Thr Ser Leu Arg Asp. Lys Gly Lys Lys Glu; ; 

■V: : - : ;.VV:i25V: : : * 3'Q.V:' . ' . : t 3 '5" : ; 

Tyr Ala Leu Phe Ty^ Asp Val Val Gin He Asp Asn Ser 



10 



15 



20 



Thr Asn Tyr Arg Leu . lie Ser Cys Asn Thr Ser Val He Thr Gin 

• • . 155 -160 • 165 ■ 

Ala Cys Pro Lys Val Ser Phe Glu Pro He Pro lie His Tyr . Cys 
170 : 1.7 5 180 

Ala Pro Ala Gly Phe Ala He Leu Lys Cys Lys Asp Lys Lys Phe 
185 "190 . 195 

Asn Gly thr Gly ..Pro Cys. Lys Asn Val Arg thr Val Gin Cys Thr.. 

: 200 ' 20S 210 

His Gly I le Arg/ Pro Val Val . Ser Thr Gin Leu Leu Leu Ash Gly 
215 220 225 

2 5 Ser Leu Ala Glu Glu Glu lie Val lie Arg Ser Glu Ash lie Thr 

:. :2:30 : :: 235 240 

Asp Ash Ala Lys fhr lie lie Val Gin Leu Ash Glu Ser lie Val 
245 250 • . ' 255 

He Asn Cys thr Arg Pro Asn Asn Asn Thr, Arg Lys Ser He Asn 
; . 2 60 ; 265 270 

He Gly Pro Gly Arg. Ala Phe Tyr Thr Thr Gly. Asp He He Gly. 

. 275 280 : 285 

Asp lie Arg Gin Ala His : Cys Asn Leu Ser Lys Thr Gin Trp Glu 

. : ;'.:29.0. : 295 • ■ 300 

Lys Thr Leu Arg ; G in He : Ala He Lys Leu- Glu Glu Lys Phe Lys 

. ■ . . ; ioS- '310 . 315 

Ash Lys Thr lie Ala :Phe Ash Lys Ser Ser Gly Gly Asp Pro Glu 

• : ; .""; : -32d: : . V - .' 325 : 330 

He Val Met His Ser -Phe Asn Cys Gly Gly Gly Phe Phe Tyr Cys 

' ;. ,;.33 ; 5V, . 340 . . 34.5 

Ser Thr Airq Lys Leu Phe Asn Ser Thr Trp Asn Leu Thr Gin Pro 

.:• .350 : . 355 . 360 

Phe Ser Asn Thr Gly Asp Arg Thr Glu Glu . Leu Asn He Thr Leu 

■ .;*VV =.3 ; 6.5 : ;-.- ■■' 370 . . 375 

Pro Cvs Arq lie Lys Gin lie lie Asn Leu Trp Gin Glu Val Gly 

■ ..38pj;: : , . : , " 385 390 

Lys Ala Met Tyr Ala Pro Pro He Arg; Gly Gin lie Arg Cys Ser 

. . /' 3.9 5 ; 400 ... 405 

Ser Asn lie Thr; Gly Leu Leu Leu Arg Arg Asp Gly Gly Ser Asn 

• -; 4.10. ;: : : 415 42.0 

Thr Ser Asp Asn Gin Thr Glu Thr Phe: Arg Pro Gly Gly Gly Asp 

65 ••:':.:v;;:; ; 42 5 : . : :, : - " . "430 -435 

• ■ " ,:;'- v .:::..: : -.V ." '\ '.• -107- ' " ". . • 



3 5 



4 0 



4 5 



50 
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Met Arg Asp Lys Trp Arg : Ser Glu Leu Tyr. Lys Tyr Lys Val VaL 

• ■ •'.•.•44.0: : ::'- v '->r : 44S^;-^.. -45.0' 

Arg He Glu Pro Leu Gly Val Ala Pro;; Thr Gin Ala Lys Arg Arg 

.5 455 .i:-/: =460 .■■•= 465 

Val Val Gin Arg Glu Lys Arg Ala Val Gly lie Gly Ala Met Phe 
470 475 480 

10 Leu Arg Phe Leu Gly Asp Lys Ala Ser Arg Val 

485 490 491 

(2.) INFORMATION FOR SEQ ID NO:9: : 
(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 1512 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS. : Single : 

(D) . TOPOLOGY: Linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



20 



60 



CTC GAG GTA CCT GTA TGG AAA GAA GCA ACT ACC ACT 36 
Leu Glu Val Pro Val Trp Lys Glu Ala Thr Thr Thr 
1 5 10 



25 CTA TTT TGT GCA TCA GAT GCT AAA GCA TAT AAT ACA GAG 7 5 

» Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr As rV Thr Glu 

15 '. 20 ■ 25 

AAA CAT AAT GTT TGG GCC ACA CAC GCC TGT GTA CCC ACA 114 

3 0 Lys His Asn Val Trp Ala Thr His Ala': Cys Val Pro Thr 

30 35 

GAT CCC AAC CCA CAA GAA GTA GTA TTG; ;GGA AAT GTG ACA 153 
Asp Pro Asn Pro Gin Glu Val Val Leu Gly Asn Val Thr 
35 40 45 50 

' GAA AAT TTT AAC ATG TGG AAA AAT AAC ATG GTA GAA CAA 192 
Glu Asn Phe Asn Met Trp Lys Asn. Asn Met Val Glu Gin 

55 ■ 6o: 

4 0 ; - ■ : 

ATG CAT GAA GAT ATA ATC AGT TTA TGG GAT CAA; ACT CTA 2 31 
Met His Glu Asp lie lie Ser Leu; Trp Asp Gin Ser Leu 

; 65 . .. 70 .: ,y, 7 5 . • 

4 5 / A AG CCA TGT GTA- AAA. TTA ACC CCA . CTC TGT GTT ACT TTA 270 
Lys Pro Cys Val Lys Leu .Ttir.;PrQ ;i Leu Cys Val Thr Leu 

■■; Bp ; : - '• •85 : '-':.':..; : :: ... 90 

AAT TGC ACT GAT GAT TTA GGG AAT GCT ACT AAT ACC. AAT 309 
50 : Asn Cys Thr Asp Asp Leu Gly Ash Ala Thr . Asn Thr Asn 

.95 100 •' 

; AGT AGT GCC ACT ACC AAT AGT AGT AGT TGG GAA GAA ATG 348 
Ser Ser Ala Thr Thr Asn Ser Ser ;Ser: Trp Glu Glu Met 
55 ' 105 110 •• 115'. 



: AAG GGG GAA ATG AAA AGA TGC TCT TTC AAT ATC ACC ACA 387 

Lys Gly Glu Met Lys Arg Cys Ser : Phe Ash lie Thr Thr 

.':/.-.. ' 120 • • • 125 

AGC ATA AGA GAT AAG ATT AAG AAA G A A C A T G C A CTT TTC 426 

Ser lie Arg Asp Lys lie Lys Lys .Glu His Ala Leu Phe 

130'" 135 . .14 0 • 
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10 



15 



20 



TAT AG* CTT GAT GTA GTA CCA ATA; GAT AAT GAT AAT ACC 465 
Tyr Arg Leu Asp Val Val Pro lie Asp Asn Asp Asn Thr 

145: y-lSP::;::::'..';.;-: . 3l5 5:_. • :. 

AC A TATAGG TTG ATA AAT TGT AAT ACC TCA GTC ATT AC A 504 

Thr Tyr Arg Leu lie Asn Cys Asn "Thr Ser Val. He Thr ; ; 

'160 --r:-. 165 • 

CAG GCC TGT CCA AAG GTA TCA TTT; GAG CCA ATT CCC ATA 543 
Gin Ala Cys Pro Lys Val Ser Phe Glu Pro; lie Pro lie 

170" ',• ' " : " 175 • 180 ' 

GAT TTT TGT GCC GCG GGT GGT TTT GCG ATT CTA AAG TGT 582 . 
His Phe Cys Ala Pro Ala Gly Phe Ala lie Leu Lys Cys 
185 .190 

AAT AAT AAG ACG TTC GAG GGA AAA GGA CCA TGT AAA AAT 621 
Asn Asn Lys Thr Phe Glu Gly Lys Gly Pro Cys Lys Asn 
19 5 200 . 205 

GTC AGT AC A GTA CAA TGC ACA CAT GGA ATT AGG CCA GTA .660 
Val Ser Thr Val Gin Cys Thr His Gly lie Arg Pro Val. 

210 .215 - 220 

25 GTG TCA ACT CAA CTG CTG TTA AAT GGC AGT . CTA GCA GAA 69.9 

Val Ser Thr Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu 

' 225 . ; 230 :; ; 

GAA GAG GTA ATA ATT AG A TCT GAC AAT ATC ACa'.GAC AAT 7 38 
30 Glu Gliu Val lie lie Arg Ser Asp, Ash lie Thr Asp Asn 
235 240 . 245 

ACT AAA ACC ATT ATA GTA GAG CTA A AC GAA TCT GTA GTA 7 77 
Thr Lys Thr. lie lie Val Gin Leu Asn Glu Ser Val Val 
35 250 2 55 

ATT AAT TGT ACA AGA CCC AAC AAC AAT ACA AG A AAA AGT 816 
He Asn Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser 
260 265 27Q 

40 ATA CAT ATA GGA GGA GGC ACT GCA TTT TTT GCA ACA GGA 855 

lie His lie Gly Pro Gly Ser Ala Phe Phe Ala Thr Gly 

275 280- v.. : : : 285 

4 5 GAA ATA ATA GGA GAT ATA AGA -CAA v.CCA CAC TGT AAC CTT 894 

Glu He lie Gly Asp lie Arg Gin Ala His Cys; Asn Leu 

• ■ 290' '; : 295. ■ . 



AGT AGA ACA CAA TGG AAT AAC ACT TTA GGA AAG ATA GTC 9 33 ; 
50 Ser Arg thr Gin Trp Asn Asn Thr Leu -Gly Lys lie Val 

300 305 V:;,;'.-'-',-' 1 : . 310 

ATA AAA TTA AGA GAA CAA . TTT AGA AAA CAA . TTT GGA GAA 972 
lie Lys : Leu Arg Glu Gin Phe : Arg Lys Gin : Phe Gly Glu 

55 " . 315 .'.-•'• 320; -'; 

AAA ACA ATA GTC TTT AAT CGA TCC TCA GGA GGG GAC CCG 1011 
Lys Thr Tie Val Phe Asn Arg: Ser Ser Gly Gly. Asp Pro 
325 ' . 330 • : . 335 

60 GAA ATT GCA ATG CAC AGT TTT AAT TGT . GGA' GGG GAA TTT 1050 

Glu He Ala Met His Ser Phe Asn Cys Gly Gly Glu Phe . 

340 .' . . 34 5 350 ; . 
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TTC TAC TGT; AAC ACA AC A : GCA CTG TTT A AT AGT ACC; TGG 1089: 

Phe Tyr Cys Ash Thr Thr Ala Leu; Phe Asn Ser Thr Trp .. 

■ ; "^v : -Vv^'--l ; ^V;.; : .V. 355 •.. r. : '\\r 360 ' .' . -h'-'--.: : : . -V;V:: 

5 V AATi: GTT: ACT AAA • GGG TTG AAT AAC ACT GAA GGA A AT AG C / 11 2 8 

^ Ash Val Thr Lyes Gly Leu Asn Asn Thr Giu Gly, Ash Ser. - 
;\"'J3 65, : 370' ••• 375- 

ACA GGA GAT GAA AAT ATC ATA CTC CCA TGT AG A ATA AAA 1167 

10 . Thr Gly Asp Glu Asn He lie Leu Pro Cys Arg lie Lys 
• 3i30 385 

: CAA ATT ATA AAC ATG TGG CAG GAA GTA GGA AAA GCA ATG 1206 

Gin lie lie Asn Met Trp Gin Glu Val Gly Lys Ala Met : "..\ :: -V 

15 390 . 395 400 



20 



40 



55 



TAT GCC CCT CCC ATC AGT GGA CAA ATT AGA TGT TCA TCA 124 5 

Tyr Ala Pro Pro He Ser Gly Gin lie Arg Cys Ser Ser 

405 : 410 • • 415 

A AC ATT ACA GGG CTG CTA CTA ACA AGA GAT GGT GGT AGT 1284 

Asn lie Thr. Gly Leu Leu Leu Thr Arg Asp Gly Gly Ser 

420 ' • ■ 42 5 ■ • • 



25 ;'. AAG AAC GAG AGC . ATC ACC ACC GAG GTC TTC AGA CCT GGA . 1323 

Lys Asn Giu Ser lie Thr Thr Glu Val Phe Arg Pro Gly . 
430 ,4 35 4 40 

GGA GGA GAT ATG AGG GAC AAT TGG AGA AGT GAA TTA TAT .1362 

30 .Gly Gly Asp Met Arg Asp Ash Trp Arg Ser Glu Leu Tyr 
.. 445 450 



: . AAA TAT AAA GTA GTA AAA ATT GAA CCA TTA GGA GTA GCG 140l : 
Lys Tyr Lys Val Val Lys lie Glu Pro Leu Gly Val Ala 
35 455 4 60 4 65 



CCC ACC AAG GCA AAG AGA AGA GTG GTG CAG AGA GAA AAA 1440 

Pro Thr Lys Ala- Lys Arg Arg Val Val Gin Arg Glu Lys . 

4 70 . 47 5 480''. 

AGA GCA GTG GGA ACA ATA GGA . GGT ATG TTC CTT GGG TTC 14 79 

Arg Ala Val Gly Thr lie Gly Ala. Met Phe Leu Gly Phe ; , 

: ; ';V.' : : ;V-V- . ::'485 '■■ V . . 4 90 ' ' " 



4 5 TTG GGA GCA TAA AGC TTC TAG: AGT CGA CCT GCA. 1512 

Leu Gly Ala Xa a Ser Phe Xaa Ser Arg Pro Ala 
. 495 500. 504 

. (2) INFORMATION FOR SEQ ID NO: 10: : 
50. . (i.) SEQUENCE CHARACTERISTICS: . 

•( A) LENGTH: 504 amino acids 
. (B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: : 



Leu Glu Val Pro Val Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys: 
■ .': 1 ... 5 .10 • 15 



Ala . Ser Asp Ala Lys Ala Tyr Ash Thr Glu. Lys His Asn Val Trp: 

60 . • 20 ■ ■ 25 ■ ■ 30 

Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro Gin Glu Val 

: : : 35 ■■ •• 40. ■ • 45 
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10 : . 



15 



20 



25 



30 



40 



45 



55 



60 



Gly Asn 


Val 
50 


Thr Glu Asn 




Asn- 

rr,, 


Met : Trp 




Asn 


Asn 
60 


Glu Gin 


Met 
65 


His Glu Asp 


Me 


lie 

/ u 


Ser; 


Leu 


Trp Asp 


Gin 
75 


Lys Pro' 


Cys 
80 


Val Lys lieu 


Thr • 


Pro 

85 . 


Leu : 


Cys 


■val; 


Thr 


Leu: 
90 


Thr. ; Asp Asp 
95 


Leu Gly Asn 


Ala 


Thr 
100 


Asn 


Th.C 


Asn 


Ser 


Ser 
105 


Thr Asn 


Ser 
110 


Ser Ser Trp 


Glu 


Glu 
115 


Met 


Lys 


Gly 


Glu 


Met 
120 


Cys Ser 


Phe 
125 


Asn lie Thr 


Thr 


Ser 

1 *3 A 

-1 JU 


lie 


Arg 


Asp 


Ly s 


He 
135. 


Glu His 


Ala 
140 


Leu Phe Tyr. 


Arg 


Leu 
145 


Asp 


Val 


val 


Pro 


lie 

150 


Asp Asn 


Thr 
155 


Thr Tyr Arg 


Leu 


He 
160 


Asn 


Cys 


Ash 


Thr 


Ser 
165 


Thr .Gin 


Ala 
170 


Cys Pro Lys 


Val 


Ser 
175 


. Phe 


Glu 


Pro 


.. He- 


Pro 
180 


; Phe Cys 


Ala 
185 


Pro Ala Gly 


Phe 


Ala 
190 


lie 


Leu 


Lys Cys 


Asn 
195 



Ash Lys Thr Phe Glu Gly Lys Gly Pro Cys Lys. Asn A/al Ser Thr 

• 200. 205 

Val; Gin Cys Thr His Gly He Arg Pro Val Val Ser Thr Gin Leu 



200 
His 

35 — " ' 215 

Ser 
230 

Asp 
245 

He 
260 

' — _■' 275 

Glu lie lie Gly Asp 

50 . 290; 

Thr Gin Trp Asn Asn 
305 

Lys 
320 

Gly 
335 

Phe 
350 

Val 

65 "" T * 365 



235 










240. 


lie 
250 


He' 


va 1 


Gin 


Leu. 


Asn 

;255 


Pro 
265 


Asn 


Asn 


•Asn';'; 


Thr 


Arg 
270 


Ala 

280 


Phe 


Phe. 


Ala 


Thr. 


Gly 
285 


His 
295 


Cys 


Asn 


Leu- 


Ser 


Arg 
300 


lie 
310 


Val 


lie 


Lys 


:Leu 


Arg 
315 


Lys 
325 


Thr 


lie 


Val 


Phe 


Asn 
330 


Ala 
340 


Met 


His 


Ser 


Phe 


Asn 
345. 


Thr 
355 


Thr 


Ala 


Leu 


Phe 


Asn 
360 



Ser Thr Trp Asn Val Thr Lys Gly Leu Asn Asn Thr ; ; Glu.; GLy Asn 
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Ser Thr Gly Asp Glu Asn lie lie Leu Pro Gys Arg lie Lys Gin 

380 ': 385 ••• • • 390 

■■■■■ lle ile Asn Met Trp Gin Glu Val Gly Lys Ala Met Tyr Ala Pro 

- '5\... . 395 ' ^ 400 :. : : ' : ; 405. 

Pro lie Ser Gly bin lie Arg Gys Ser Ser Asn lie Thr Gly Leu: 

■ 410 ; 415 420.. 

10 . Leu Leu. Thr Arg Asp Gly Gly Ser . Lys Asn Glu Ser lie Thr Thr 

425 430 . • • : 435. 



15 



35 



55 



Glu Val Phe Arg Pro -.Gly Gly. Gly Asp Met Arg Asp Asn Trp. Arg 
440 ; ' 445 • 450 

Ser Glu Leu Tyr Lys Tyr ; Lys Val Val Lys lie Glu Pro Leu Gly 
455 460 465 



Val Ala Pro thr Lys Ala Lys Arg Arg Val Val Gin Arg Glu Lys 
20 470 . 47 5 480 

Arg Ala Val Gly Thr lie Gly Ala Met Phe Leu Gly Phe Leu Gly 
485 : 490 495 

25 Ala Xaa Ser Phe Xaa.Ser Arg Pro Ala 

500 504 

(2) INFORMATION: FOR SEQ ID NO : 1 1 : 
(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 1501 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY : : Linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



CTC GAG GTA CCT GTG TGG AAA GAA GCA ACT ACC ACT 36 
Leu Glu Val Pro Val; Trp. Lys Glu Ala Thr Thr Thr 

1 . : '.v'; : V : '5';. : -. : ...'. 10 



40 . CTA TTT TGT GCA TCA; GAT GCT AAA GCA TAT AAT ACA GAG 75 
Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asn Thr Glu 

■ • 15 - ■ - . 20 " 2 5 

AAA CAT AAT GTT TGG GCC ACA CAC GCC TGT GTA CCC ACA 114 
45 Lys His Ash Val Trp Ala thr His Ala Cys Val Pro Thr 

• : •. •.••^:•^..:3p: , : • ; : '' : • ' 35 ^r^" 

GAT CCC A AC CCA CAA GAA GTA GTA TTG GGA AAT GTG ACA 153 
Asp Pro . Asn Pro Gin: Glu Val Val Leu Gly Asn Val Thr 
50 40 4 5 50 

. GAA AAT TTT A AC ATG TGG AAA AAT A AC ATG GTA GAA CAA 192 

Glu Asn Phe Asn Met Trp Lys Asn Asn Met Val Glu Gin 

55 ■ ' ■ .60 . ■ 



ATG, CAT GAA GAT ATA ATC AGT TTA TGG GAT CAA AG T CTA 231 

Met His Glu Asp lie- lie Ser Leu Trp Asp Gin Ser Leu 

■' 65 ■ 70 i - " 75 



60 . AAG CCA TGT. GTA AAA TTA. ACC CCA CTC TGT GTT ACT TTA 270 
Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu. 

80. • 85 90 
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20 



.40 



AAT TGC ACT GAT OAT TTA GGG AAT GCT ACT AAT ACC AAT 309 
Asn Cys Thr Asp Asp Leu Gly Ash Ala T £j- Asn Thr Asn : . 



100 

AGC AGT GCC ACT ACC AAT AGT AGT ACT TGG GAA : : GAA ATG 348 
; Ser Ser Ala Thr Thr Asn Ser Ser Ser Trp Glu Glu Met . 

::v - ;::: :,: ] ; G 5 ' - . .. 110 : .v 115 ' * : 

. : '>: :A AG GGG GAA ATG AAA AGG TGC TCT TTC AAT ATC ACC ACA 387 
10 Lys Gly Glu Met Lys Arg Cys Ser Phe Asn lie Thr Thr 
120 125 

: A GC ATA AG A GAT AAG ATT AAG AAA GAA. CAT GCA CTT TTC 426. 
■ Ser He Arg Asp Lys lie Lys Lys Glu His Ala Leu Phe. 
15 130 ■ 135 140 



TAT AG A CTT GAT GTA GTA CCA ATA GAT AAT GAT AAT ACC 465 
Tyr Arg Leu Asp Val Val Pro lie Asp Asn Asp Asn Thr.. 

145 1 50 . ■ .. 

ACA TAT AGG T.TG ATA AAT TCT AAT ACC TCA GTC ATT ACA. S04 
Thr Tyr Arg Leu lie Ash Cys Asn Thr Ser Val lie Thr 

25 CAG GCC TGT CCA AAG GTA TCA TTT GAG CCA ATT CCC ATA- 543 

. Gin- Ala Cys Pro Lys. Val Ser Phe . Glu Pro He Pro tie, 

' ,170." ■ 175 . 180 

CAT TTT TGT GCC CCG GCT GGT TTT GCG ATT CTA AAG TGT 582 
30 His Phe Cys Ala Pro Ala Gly Phe Ala He Leu Lys Cys 

185 . 190 . . 

AAT AAT AAG ACG TTC GAG GGA AAA GGA CCA TGT AAA AAT 621 
' Asn Asn Lys Thr Phe Glu Gly Lys Gly Pro Cys Lys Asn : 
35 195 200 205. 

GTC AGT ACA GTA CAA TGC ACA CAT GGA ATT AGG CCA GTA 660 



liTi; m*i rt^-rt win ^™-». *r w . • • ^ w " " « " ~ <>^S 

Val Ser Thr Val Gin Cys Thr His Gly lie Arg Pro Val 

215 ; 2 /V 



210 



GTG TCA ACT CAA CTG CTG TTA AAT GGC AGT CTA GCA GAA 699 
Val Ser Thr Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu 



225 



4 5 GAA GAG GTA ATA ATT AGA TCT GGC AAT ATC .ACA G AC AAT ; 7 38 

^ : Glu Glu Val lie lie Arg Ser Gly Asn lie Thr Asp Asn 

235 • 240 . .," 245 :.-.; 

. : ACT AAA ACC ATT ATA GTA CAG CTA AAC GAA TCT GTA GTA 777 
50 Thr Lys Thr He He Val Gin Leu Asn Glu. Ser Val Val V; 
.250 255 

^ : ' : ATT AAT TGT ACA AGA 7TCC AAC AAC AAT ACA AGA. AAA AGT 816 
•'v: : ' lie . Asn Cys Thr Arg Ser Ash Asn Asn Thr Arg Lys Ser : 

55 . 260 . " ' 265 - \ 

ATA CAT ATA GGA CCA GGG AGT GCA TTT TTT GCA ACA QGA 855 
He His He Gly Pro Gly Ser Ala Phe Phe Ala Thr Gly. 

275 280 ■ 285 

GAA ATA ATA GGA GAT ATA AGA CAA GCA CAC TGT AAC CTT 894 

Glu lie lie Gly Asp lie Arg Gin Ala His Cys Asn Leu 

.... 2.9.0 . ■'• " 295; 



60 : 
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AGT AG A ACA CAA TGG AAT A AC ACT TTA GGA AAG ATA GTC 933 

Ser Arg Thr Gin Trp Asn Asn Thr Leu Gly Lys lie Val 

\30C) 305, ' .. . .V.. : :3 10 . . 

5 ATA AAA. TTA AG A GAA CAA TTT AG A AAA CAA TTT GGA GAA 972 

lie Lys Leu Arg Glu Glh Phe Arg Lys Gin Phe Gly Glu 

. •• : V '.';' : ". : 31 : 5V : . , 320 ' 

AAA ACA ATA GTC TTT AAT CGA TCC TCA GGA GGG GAC CCG 1011 

10 Lys Thr He Val Phe Asn Arg Ser Ser Gly Gly Asp Pro 

325 :.. 330 33 5 

GAA ATT GCA ATG CAC AGT TTT AAT TGT GGA GGG GAA TTT 1050 

Glu lie Ala Met His Ser Phe Asn Cys Gly Gly Glu Phe 

15 340 345 350 

TTC TAC TGT AAC ACA ACA GCA CTG TTT AAT AGT ACC TGG 1089 

Phe Tyr Cys Asn Thr Thr Ala Leu Phe Asn Ser Thr Trp 

355 360 



20 



40 



60 



AAT GTT ACT AAA GGG. TTG AAT AAC ACT GAA GGA AAT AGC 1128 
Asn Val Thr Lys Gly Leu Asn Asn Thr Glu :Gly Ash Ser 
365 370 375 



25 ACA GGG GAT GAA AAT ATC ATA CTC CCA TGT AGA ATA AAA 1167 

Thr Gly Asp Glu Asn lie lie Leu Pro Cys Arg He Lys 

. v : 380 335 

CAA ATT ATA AAC ATG TGG CAG GAA GTA GGA AAA GCA ATG 1206 

30 Gin lie lie Asn Met Trp Gin Glu Val Gly Lys Ala Met 

390 395 400 

TAT GCC CCT CCC ATC AGT GGA CAA ATT AGA TGT TCA TCA 1245 

Tyr Ala Pro .Pro He Ser Gly Gin lie Arg Cys Ser Ser 

35 405 ... .410 4 15 

AAT ATT ACA GGG CTG CTA CTA ACA AGA GAT GGT GGT AGT 12 84 

Asn lie. Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Ser 



420 . 42? 

AAG AAC GAG AGC ATC ACC ACC GAG GTC TTC AGA CCT GGA 1323 
Lys Asn Glu Ser lie Thr . Thr Glu Val Phe Arg Pro Gly 

■ 430 4 35 440. . 

45 GGA GGA GAT ATG AGG GAC AAT TCG AGA AGT. GAA TTA TAT 1362 

Gly Gly Asp Met Arg Asp Ash Trp Arg Ser Glu Leu Tyr 
-44 5: 450 

AAA TAT AAA GTA GTA AAA ATT GAA CCA TTA GGA GTA GCG 1401 

50 Lys Tyr Lys Val Val Lys lie Giu Pro Leu Gly Val Ala 

45 5 . 460 4 65 

CCC ACC AAG GCA AAG AGA AGA CTG GTG CAG AGA GAA AAA 1440 

Pro Thr Lys Ala Lys Arg Arg Val Va 1 Gin Arg Glu Lys 
55 .470 . 475 480- 

AGA GCA GTG .GGA ACA ATA GGA GCT ATG TTC CTT GGG TTC 1479 

Arg Ala Val Gly Thr Tie Gly Ala Met. Phe Leu Gly Phe 

485 ... 490 



TTA GGA. GCA TAA AGC TTC TAG A 1501 
Leu Gly. : Ala Xaa Ser Phe Xaa 
495 . 500 
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10 



15 



25 



4 0 



4 5 



50 



55 



60: 



(2) INFORMATION FOR SEQ ID NO: 12: 
■{!')'. SEQUENCE CHARACTERISTICS: 

LENGTH: '500; amino. -acids. ;. 
; ( B) TYPE : Amino Acid 

(D) TOPOLOGY : Linear 

' (xi!) SEQUENCE DESCRIPTION: SEQ ID NOil2: ■ 

Leu Glu Val Pro Val Trp Lys Glu Ala Thr Thr Thr- Leu Phe Cys 

• '-'5' • 10 ; ■ : ' 

Ala Ser Asp Ala Lys Ala Tyr Asn Thr Glu Lys His Asn Val Trp 

■ 20 . • 25 ' - JU 

Ala Thr His Ala Cys Val; Pro. Thr Asp Pro Asn Pro Gin- Glu Val 

35 • • ' . • 40 . ■ - ; V : - . 7 S 

Val Leu Gly Asn Val Thr Glu Asn Phe Asn Met Trp Lys Asn Asn 



50 



20 . Met Val Glu Gin Met His Glu Asp He lie Ser Leu Trp Asp Gin 



65 



Ser Leu Lys Pro Cys Val Lys Leu; Thr Pro Leu Cys Val Thr Leu 

** 85 . < u 



80 



Asn Cys Thr Asp Asp Leu Gly Asn Ala Thr Asn Thr Asn Ser Ser 
Ala Thr Thr Asn Ser Ser Ser Trp Glu Glu Met Lys Giy Glu Met 

30 v ■ : no- . . 115 • - u 

Lys Ar^ Cys Ser Phe Asn lie Thr Thr Ser lie Arg Asp Lys lie 

125 .. 130 ....... 1J = 

35, Lys Lys Glu His Ala Leu Phe Tyr Arg Leu Asp Val Val Pro lie 



125 130 

Leu Phe Tyr Arg 

.. ... 140 ; 1"5 , ; ; . 

Asp Ash Asp Asn Thr Thr Tyr Arg Leu lie Asn Cys Asn Thr Ser 

■ '■' - 155 loU. - lu j 



Val- 


lie Thr Gin 


Ala Cys 

170 . 


Pro Lys 


Val 


Ser 

175 : 


Phe 


Glu 


Pro 


lie 


Pro 
180 


ue 


His Phe Cys ; 


Ala 
185 


Pro 


Ala; Gly, 


Phe; 


Ala 
190 


tie 


Leu; 


ty. s 


Cys 


Ash 
195 


Asn 


Lys ; .Thr Phe 


Glu 
200 


Gly 


Lys Gly 


Pro 


Cys 
205 


Lys 


A s TV 


Val. 


Ser 


Thr 
210 


Val; 


Gin Cys Thr 


His 
215 


Gly 


lie Arg 


Pro 


Val 
220 


Val- 


Ser- 


Thr" 


Gin 


Leu 
225 


Leu 


Leu Asn Gly 


Ser 
230 


Leu ■ 


Ala Glu 


Glu 


Glu 
235 


val 


lie 


lie 


Arg 


Ser 
240 


Gly ; 


Asn lie : Thr 


Asp 
245 


Asn 


Thr Lys 


Thr 


lie 
250 


lie 


Val 


Gin 


Leu 


Asn 

255 


Glu 


'ser: Val Val 


VI le 

260: 


Asn 


Cys Thr 


Arg 


Ser 
265 


Asn 


Asn 


Ash 


Thr 


Arg 

270. 


Lys : 


Ser lie His 


lie 
275 


Gly. 


Pro Gly 


Ser 


Ala 

280; 


Phe 


Phe 


Ala 


.Thr 


Gly 
285 
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Glu He lie Gly Asp He Arg Gin Ala His Cys Asn Leu Ser Arg 

290 " '295' ' • : . ' 300 

Thr Gin Trp Asn Asn Thr Leu Gly Lys He Val He Lys Leu Arg 

: .. .5- .. • : 305 310 . ■ . 315; 

Glu Gin Phe Arg Lys Gin Phe Gly Glu Lys Thr He Val Phe Asn 

320 -32S 330 

10 Arg Ser Ser Gly Gly Asp Pro Glu He Ala Met His Ser Phe: Asn 

. 335 . 340 345 



15 



3 0 



4 5 : 



Cys Gly Gly Glu Phe Phe Tyr Cys Asn Thr Thr Ala Leu Phe Asn 

• ■ ".• 350 •• 35 5 • • 360 

Ser Thr Trp Asn Val Thr Lys Gly Leu Asn Asn Thr Glu Gly Ash 

365 370 . 375 



Ser Thr Gly Asp Glu Asn -lie He Leu Pro Cys Arg He Lys Gin 

20 380 385 390 

lie He Asn Met Trp Gin Glu Val Gly Lys Ala Met Tyr Ala Pro 

: 395 400. 405 

25 Pro lie Ser Gly Gin lie Arg Cys Ser Ser Asn He Thr, Gly Leu 

: '...'410.- : 415 420 



Leu Leu. Thr-. Arg Asp. Gly Gly Ser Lys Asn Glu Ser lie Thr Thr 
425 430. 435 

Glu Val Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg 
440 .-445" 450 



Ser Glu Leu Tyr Lys Tyr Lys Val Val !Lys He Glu Pro Leu Gly 
35 • : ' .455 460 . 465 

Val Ala. Pro Thr Lys Ala Lys Arg Arg Val Val Gin Arg Glu Lys 

' *' 410 . : ; 475 480 

40 Arg Ala Val Gly Thr lie Gly Ala Met Phe Leu Gly Phe Leu. Gly.. 

^V. : ;v- 485 .//V . .490 " 495 



Ala Xaa; Ser Phe Xaa 
500 



(2) INFORMATION FOR SEQ ID NO: 13 : 
( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1514 base pairs 
(;B) TYPE: 1 Nucleic Acid 
50 (C). STRANDEDNESS: Single 

( D). : . TOPOLOGY: Linear 
(xi). SEQUENCE: DESCRIPTION: SEQ ID NO: 13: 

55 GG GAA TTC GGA TCC GGG GTA CCT GTG TGG AAG GAA GCA 38 : 

Glu Phe. Gly Ser Gly Val Pro Val Trp Lys Glu Ala; 

' ; . 5 • ■ 10 

ACC ACG ACT CTA TTC TGT GCA TCA GAT GCT AGA GCA TAT 77 

.60 Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Arg Ala Tyr 

• 15 • '. 20 .25-: 

GAC ACA GAG GTA CAT AAT GTT TGG GCC AC A CAT GCC TGT 116 
Asp thr Glu Val His Asn Val Trp Ala Thr His Ala Cys 

65 .. 30 : : 3 5 . 



GTA CCC •ACA . OAC : CCT ACT CCA CAA GAA. .CTA CTT TTC. °* A : ^ 

4b; 



Val Prb Thr.Asp Pro Ser Pro Gin Glu Val Val Leu Glu 



AAT GTG ACA GAA AAT TTT AAC ATG TCG AAA - AAT AAC ATG 194, : . 
Asn val Thr Glu Asn Phe Asn Met Trp Lys Asn Asn Met 

' ••■55- ; 60 / 

GTA GAA CAA ATG CAT GAG GAT ATA ATT AGT TTA TGG GAT, 233 ; 
Val Glu Gin Met His Glu Asp lie lie Ser Leu Trp Asp 

65 . ■ 70 75 

CAA AGC TTA AAG CCA TGT GTA AAA TTA ACC CCA CTC TGT 272 
Oln Ser leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys 

■ .. 80 • 85 * 

GTT ACT TTA AAT TGC ACT GAT TAT AGG AAT GCT ACT GAT 311 : 
Val Thr Leu Asn Cys Ser Asp Tyr Arg Asn Ala Thr Asp 

95 100 

TAT AAG AAT GCT. ACT GAT AGC ACT AGT. AGT AAC GAG GGA 350 
Tyr Lys Ash Ala Thr Asp Thr Thr Ser Ser Asn Glu Gly 

. 105 . HO 115 • 

AAG ATG GAG AGA GGA GAA ATA AAA AAG .TGC TGT TTC AAT 389 
Lys Met Glu Arg Gly Glu lie Lys Asn Cys Ser Phe Asn 



120 



ATT ACC ACA AGC ATA AAA AAT AAG ATG CAG AAA GAA TAT 428 
He Thr thr Ser -lie Lys Asn- Lys Met. Gin Lys Glu Tyr 
130 . 140 

GCA CTT TTC TAT- AAA CTT GAT ATA GTA CCA. ATA GAT AAT 467 
Ala Leu Phe Tyr Lys Leu Asp. lie Val Pro He Asp Asn ; 
145 : - 150 . 155 

ACA AGC TAT ACA TTG ATA AGT TGT AAC ACC TCA GTC ATT 506 
Thr Ser Tyr Thr Leu lie Ser Cys Asn Thr Ser Val He 
: 160 165 . 

ACA CAG GCC TGT CCA AAG GTA TCC TTT GAA CCA ACT CCC 545 
Thr Gin Ala Cys Pro Lys Val Ser Phe Glu -Pro Thr. Pro 
170 v. 175 ' 180 



ATA CAT TAT TGT GCT CCG GCT GGT TTT GCG ATT CTA AAG 584 
He His Tyr Cys Ala Pro: Ala Gly Phe Ala He Leu Lys 
185 - 190 



GTA GTA TCA ACT CAA CTC CTG TTA AAT GCC AGT CTA GCA 701 
Val Val Ser Thr Glh: Leu: Leu Leu Asn Gly Ser Leu Ala 
• .225 ' 230 ••; 



GAA GAA GAG GTG GTA ATT AGA TCT :GAC AAT TTC ATA CAC 740; 
Glu Glu Glu Val Val lie Arg Ser Asp Asn Phe Lie Asp 

- 235 . 240 .. 245 ... 
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. : A AT ACT AAA ACC ATA ATA GTA CAG ; CT G AAA GAA -TGT GTA 779 

Asn Thr Lys Thr He l ie Val : Gin Leu Lys G lu Ser Val 

■ 250 :' :: y\y■ :2 **:■■■■ 
5 GAA ATT-'. A AT TGT ATA AGA CCC ■ AAC : : ; A AT : .A A t ACA AG A AAA 818 
;. : Glu lie Asn Cys He Arg. .Pro.. Asn Arg Lys. 

260 26^'^' V;;';.V:: :"V:.;': ': 270 

. GGT ATA CAT ATA GGA CCA GGG AGA GCTA TGG TAT GCA ACA 857 
10 Gly lie His lie Gly Pro . Gly Arg Ala Trp Tyr Ala Thr 

., 275 . 280 285 

GGA GAA ATA GTA GGA GAT ATA AGA A AG GCA TAT TGT AAC 896 
Gly Glu He Val Gly Asp lie Arg Lys Ala Tyr Cys Asn 
15 290 29 S 

ATT AGT ACA ACA AAA TGG AAT. AAC. ACT TTA ATA CAG ATA 935 

He Ser Arg Thr Lys Trp Asn Asn Thr ;Leu Lie Gin He 
. 300 305 : 310 

20 

GCT AAC AAA TTA AAA GAA AAA TAT AAT ;ACA ACA ATA AGC 974 

Ala Asn Lys Leu Lys Glu Lys Tyr Asn Thr Thr. lie Ser 

. 315 ■ . . 320. 

25 TTT AAT CGA TCC TCA GGA GGG G AC CCA, GAA ATT GTA ACG 1013 

. Phe Asn Arg Ser Ser Gly Gly. Asp Pro Glu lie Val Thr 

325 330 . .335 



30 



CAT AGT TTT AAT TGT GGA GGG GAG TTT . TTC TAG TGT GAT 1052. 
His Ser Phe Asn Cys Gly Gly Glu Phe .Phe Tyr Cys Asp 
• 340 345 :• 350 



TCA: ACA CAA CTG TTT AAT AGT ACT TGG AAT TTA AAT GGT 1091 
Ser Thr Gin Leu Phe Asn Ser Thr : Trp Asn Leu Asn Gly 
35 355 360 

ACT TGG AAT TTT ACT GCA GGG TCA A AT G A A ACT GAA GGC 1130 

Thr Trp Asn Phe Thr Ala Gly Ser Ash Glu Thr Glu Gly 

365 370 ■ ' 375 .. 

4 0 ■:--". ; :V-'; . 

AAT ATC ACA CTC CCA TGG AG A ATA : AAA CAA ATT ATA AAC 1169 

Asa He Thr Leu Pro Cys Arg He : Lys Gi n I le: l ie Asn 

; 380 ' : :385 . 

45 AGG TGG CAG GAA GTA GGG AAA GCA ATG TAT GCC CCT CCC 1208 

Arg Trp Gin Glu Val Gly Lys. Ala . Met Tyr Ala V r o Pro 
390 395 400 

ATC AGT GGA CAA ATA AAA . TGC TCA : TCA AAC ATT ACA GGG . 1247 
50. lie Ser Gly Gin lie Lys Cys Ser Ser Asn lie Thr Gly 

' 405 410 .:/. - :• ■ -415 ' ■ ■ 

ATG ATA TTA ACA AGG GAT GGT .:GGf AAC: : gag AAC . AAT AAT 1286 
Met lie Leu Thr Arg Asp: Gly Gly Ash Glu Ash Asn Asn 

55 / 420 ' : : :: .';.;' - :; , :V - : 4 ; 25 ; 

GAG AGC AGT ACT ACT GAG ACC TTC AGA GGG GGA GGA GGA 1325 
Glu Ser Ser Thr Thr Glu Thr Phe. '.Arg Pro Gly Gly Gly 

430 . '::4-3 5-':'.' . 440 ■ 

60 ■• . • • • - "--:■:, • • . . 

GAT ATG AGG AAC AAT TGG AGA AGT GAA TTA TAT AAA TAT 1364 
Asp Met Arg Asn Asn Trp Arg Ser Glu Leu Tyr Lys Tyr 
•445 . ■■■ ■ 450-"' • • " 
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AAA GTA : GT A : AAA : ATT G A A CCA TTA GGA GTA GCA CCC ACC 1403 v 
Lys Val Vai . Lys : l ie Glu Pro Leu Gly. Val Ala Pro Thr ;. ; ■ 
. 455- 460 : 465 ' 

5 AAG GCA AAG AGA AGA GTG GTG C AG. AG A GAA AAA AGA GCA 1442 :,\ :V: 

Lys Ala. Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala 
* 470 475 480 : 

GTG GGA GCG GTA GGA GCT ATG TTC CTT GGG TTC TTA GGA 1481 V, 
10 Val Gly Ala Leu Gly Ala Met Phe Leu Gly Phe Leu Gly 

485 490 . 

GCA TAA ; AGC TTC TAG ACC GAC TCT AGA GGA TCC 1514 . 
Ala Xaa Ser Phe Xaa Thr Asp Ser Arg Gly Ser 

15 - ' 495.' -v T-'- ■ 500 ■ * • 504 ■ 

(2) INFORMATION FOR SEQ ID NO:14: 
( i) SEQUENCE CHARACTERISTICS: , 
(A) LENGTH : 504 amino acids 
20 (B) TYPE: Amino Acid 

(i)) TOPOLOGY: Linear 
(xi) SEQUENCE. DESCRIPTION: SEQ ID NO: 14 : 

Glu Phe Glv Ser Gly Val Pro Val Trp Lys Glu Ala Thr Thr Thr 

25 1 \ .. 5 . . 10 .. . 15 

Leu Phe Cys Ala Ser Asp Ala Arg Ala Tyr Asp Thr Glu Val His 

20 25 ■ : 30; 

Asn Val Trp Ala thr His Ala Cys Val Pro Thr Asp Pro Ser .Pro 

. 35 . . 40 .. ■ 4.5 . 

Gin Glu Vai ; Val Leu Glu Asn Val Thr Glu Asn Phe Ash Met: Trp 

. 50 ' . ■ ; .55. . 6P ; 

Lys Asn Asn Met Val Glu Gin Met His Glu Asp lie lie Ser Leu: 

. 65 , ■ . . 70. ■- ; .7.5 

Trp Asp Gin Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys 

40 80 85 . V:::-:?:Q,- 

Val Thr Leu Asn Cys Ser Asp Tyr Arg Ash Ala Thr Asp Tyr Ly sj 

9 5'. . 100 105 

4 5 hsn hla:^hr ksp 'thr Thr Ser ; Ser Asn/GlU Gly Lys Met Glu Arg : 

.■ : : : :l:ld.'' : : . 115 V 120 

Gly Glu lie Lys Ash Cys Ser Phe Asn lie Thr Thr Ser lie Lys : 

■ ■ 125 •■■130' . . 1:3.5: 



30 



3 5 



50 ' ' 

Asn Lys Met Gin Lys Glu Tyr Ala- Leu Phe Tyr Lys Leu Asp lie 

■■ : 140 145 . : 150 

Val Pro lie Asp Asn Thr Ser Tyr Thr Leu He Ser Cys Asn Thr 
55 ' 155 :■ . " -160 , : . 165 

Ser Val lie thr Gin Ala Cys Pro Lys Val Ser Phe Glu Pro Thr 

; : 17 0 • 175 .. 180 

60 Pro lie His tyr Cys A 1 a Pro Ala" Gly Phe Ala lie Leu Lys Cys 

■■ ■ 185- • • ' 190 ■ 195. 

Asn Asp Lys Lys Phe Ser. Gly Lys Gly Glu Cys Lys Ash Val Ser 
200 ■ ■ 205 210 

65 ' 'V •. ' . . . - . • •• • 
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". thr Val Gin Cys Thr His Gly lie Arg Pro Va 1 Val Ser Thr Gin 

■ -215;: : :. -^ .- 225 

Leii Leu Leu. Asn Gly Ser Leu Ala Glu Glu .Glu .Val Val He: Arg 

5 '.■ . ■ 230 : : .235:.:-;. 240 

. Ser Asp Asn. Phe lie Asp : -Asn Thr : Lys Thr He He Val Gin Leu 

245 : • 250 255 

10 Lys Glu Ser Val Glu He Asn Cys He Arg Pro Asn Asn Asn Thr 

260 265 270 



15 



3 0 



.45 



60 



Arg Lys Gly lie His lie .Gly Pro Gly Arg Ala trp Tyr Ala Thr 

275 ■ ' ■ .::•:...•' .280 • 285 



Gly Glu He Val Gly Asp lie Arg Lys Ala Tyr Cys Asn tie Ser 
. 290 295 300 



Arg Thr Lys Trp Asn Asn Thr Leu lie Gin He Ala Asn Lys Leu 

2 0 305 : 310 315 

Lys Glu Lys Tyr Asn Thr thr tie Ser Phe Asn Arg Ser Ser Gly 

320 32 5 330 

25 Gly. Asp Pro Glu lie Val Thr His Ser Phe Asn Cys Gly Giy Glu 

335 ; 340 345 



Phe Phe Tyr Cys Asp Ser Thr Gin Leu Phe Asn Ser Thr Trp Asn 

350 .355 . 360 

Leu Asn Gly Thr Trp Asn Phe thr Ala Gly Ser Asn Glu Thr Glu 

365 ,370 375 



Gly Asn lie Thr Leu Pro Cys Arg He Lys Gin lie lie Asn Arg 

3 5 ' 380 .385 390 

trp Gin Glu Val Gly Lys Ala Met Tyr Ala Pro Pro lie Ser Gly 

395 ' . : . .400 405 



40 Gin lie Lys Cys Ser Ser Asn He thr Gly Met lie Leu Thr Arg 

410 415 420 



Asp Gly. Gly Asn Glu Asn Asn Asn Glu Ser Ser Thr thr Glu thr 
425 • 430 435 



Phe Arg Pro Gly Gly Gly : Asp Met Arg Asn Asn Trp Arg Ser Glu : 

• 440 •••.•'.-.v-.' • 445 ■ ■ .". 450 



Leu Tyr Lys Tyr Lys Val Val : : Lys I le Glu. Pro Leu Gly Val Ala 

50 455 460 465. 

Pro Thr Lys Ala Lys Arg Arg Val . Val Gin Arg Glu Lys Arg Ala 

470 ■ - 475 ■ ;■" 480 

55 Val Gly Ala Leu Gly Ala Met Phe, Leu Gly Phe Leu Gly Ala Xaa 

485 • .. 490- 495 



Ser Phe Xaa Thr Asp Ser Arg Gly Ser 
500 : 504 . 



(2) INFORMATION FOR SEQ ID NO : 15: 
( 1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 08 base pairs : 

(B) TYPE: Nucleic Acid 

65 (C) STRANDEDNESS: Single ; : 
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10 



15 



(D) TOPOLOGY: Linear 
: :( x i ) ; ; SEQUENCE : DESCRI PTI ON: : : . SEQ 5 ID NO: 15 : 

G GTA CCT GTG . TGG A AG. GAA GCA ACC ACC ACT CTA TTC 37 

Val Pro Val Trb :iys : oiii- VAlai Thr Thr Thr Leu Phe 

.:;-;:V • ■ ■'/:■:'■ ' - " 5 ^^^"V ^ \ . ' ' . 10 ', V :: V , ' " ' ' ' 

TGT GCA TCA-GAT GCT AGA GCA TAT GAC AC A GAG GTA £AT 76 
Cvs Ala Ser Asp Ala Arg Ala Tyr Asp Thr Glu Val Wis 

:', v ; : 15 ■■ : ;.2o. ■ ■ ■■■ 2=5- ■ 

'AAT GTT TGG GCC ACA CAT GCC TGT GTA CCC ACA GAC CCT 115 
Asn Val Trp Ala Thr His; Ala Cys Val Pro Thr Asp Pro 

30 ... 35 

AGT CCA CAA GAA GTA TT.T.TTG GGA AAT GTG ACA GAA AAT 154 
Ser Pro Gin Glu Val Phe Leu Gly Asn Val Thr Glu: Asn 
40 45 50 

20 TTT AAT ATG TGG AAA AAT A AC. ATG GTA GAA CAA ATG TAT 193 

Phe Asn Met Trp Lys Asn Asn Met Val. Glu Gin Met Tyr 

5-5 : ■ 60 ■ 

GAG GAT ATA ATT AGT TTA TGG GAT CAA AGC TTA AAG CCA 232 
25 Glu Asp He lie Ser Leu Trp Asp Gin Ser Leu Lys Pro 

. 65 . : 70 75 

TGT GTA AAA TTA ACC CCA CTC TGT GTT ACT TTA AAT TGC 271 
Cvs Val Lys Leu Thr Pro Leu Cys; Val Thr Leu Asn Cys 
80 . 85 90 

AGT GAT TAT AGG AAT GCT ACT GAT TAT AAG AAT GCT ACT 310 
Ser Asp Tyr Arg Asn -Ala Thr Asp Tyr Lys Asn Ala Thr 
95 .100 

GAT ACC ACT AGT AGT A AC GAG GGA AAG ATG GAG AGA GGA 349 
Asp Thr Thr Ser Ser Asn Glu Gly Lys Met Glu Arg Gly 

105 •.; 110 . US 

4 0 GAA ATA AAA A AC TGC TCT TTC . AAT ATC ACC ACA AGC ATA 388 

. Glu He Lys Asn Cys Ser Phe Asn lie Thr Thr Ser lie 

: i2o.-. 125.. ; 

AAA AAT AAG ATG CAG AAA GAA TAT GCA CTT TTC TAT AAA 427 
4 5 Lys Asn Lys Met Gin Lys Glu Tyr Ala Leu Phe Tyr Lys 

. 130 ■ . '■ ;'i3-5 - ; ■. 1*0 ■:- ./ . 

CTT AAT ATA GTA C C A AT A . G AT A AT ACA AG C TAT ACA TTG 4 66 
Leu Asn lie Val Pro He Asp Asn Thr Ser Tyr Thr Leu 
" 14.5 150 155 

ATA AGT TGT A AC ACC TCA GTC ATT ACA CAG GCC TGT CCA 505 

lie Ser Cys Asn Thr Ser Val lie Thr. Gin Ala Cys. Pro 

■ ' -.160- ■.' :: : : ;16 5, • 

AAG GTA TCC TTT GAA CCA ATT CCC. ATA CAT TAT. TGT GCT. 544 
Lys Val Ser Phe Glu Pro lie Pro lie His Tyr Cys Ala. 
. 17Q 175 • 180 

CCG GCT GGT TTT GCG ATT: CTA AAG TGT AAT GAT AAG AAG 583 
Pro Ala Glv Phe Ala lie Leu. Lys Cys Asn Asp Lys Lys.: 

185 :^s.- : - ", , 



30 



35 



50 



55 



60 
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. TTC AGT GGA. AAA GGA GAA TGT : AAA : AAT CTC AGC ACA GTA 622 

: Phe Ser Gly Lys Gly Glu Cys Lys ; Asn ; Val Ser Thr Val 
•• 195 ■ • • 200y : : . 205 " " 

5 V 6A A TGT ACA CAT GG A ; ATT A6G ^ d A G T A I G T A TCA ACT CAA 661 
Gin Cys Thr His Gly lie ; Arg Pro Val Val Ser Thr Gin 
210. .•/•': : ;. : : : . 215 . 220 

CTG CTG TTA AAT GGC AGT CTA : GCA GAA GAA GAG GTG. GTA 70.0 
10 lieu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val 

225 : 230 

ATT AG A TCT GAC AAT TTC ACA GAC AAT ACT AAA ACC ATA 7 39 
lie Arg Ser Asp Asn Phe Thri Asp . Asn Thr Lys Thr He 
15 235 240 . 245 

ATA GTA CAG CTG AAA GAA TCT GTA GAA ATT AAT. TGT ATA 778 

lie Val Gin Leu Lys Glu : Ser Val Glu lie Asn Cys He 

250 2 55 . 

20 

AG A CCC AAC AAT AAT ACA AG A A A A : G GT ATA CAT ATA GGA 817 

Arg Pro Asn Asn Asn Thr Arg Lys Gly lie His lie Gly 

260 265 . : 270 

25 CCA GGG AG A GGA TGG TAT GCA ACA GGA GAA ATA GTA GGA 856 

Pro. Gly . Arg Ala Trp Tyr Ala Thr Gly .Glu l ie Val Gly 
275 '• : 280 ' 285 

GAT ATA AGA CAG GCA TAT TGT AAC ATT AGT AG A ACA : AAA 895 
30 Asp lie Arg Gin Ala Tyr Cys Asn Tie Ser Ary Thr Lys 

290 . 295 

TGG AAT AAC. ACT TTA ATA . CAG ATA GCT AAC AAA TTA AAA 9 34 
Trp Asn Asn Thr Leu He : G In lie Aia Asn Lys Leu Lys 
35 300 305 310 

GAA AAA TAT AAT ACA ACA AT A AGC TTT AAT CGA TCC TCA 973 

Glu Lys Tyr Asn Thr Thr lie Ser Phe Asn Arg Ser Ser 

315 " 32.0 

40 

GGA GGG GAC CCA GAA. ATT GTA ACC -CAT AGT TTT AAT TGT. 1012 

Gly Gly Asp Pro Glu lie Val Thr His Ser Phe Asn Cys 

■ 325 330/. :: ; '-;.: : ;VV ,335 

45 GGA GGG GAA TTT TTC TAC TGT AAT : TCA ACA CAA CTG TTT 1051 

Gly Gly Glu Phe Phe Tyr Cys : Ash Ser Thr Gl n Lc?u Phe 

••• 34 0 : : - ; :: :. : .'345, .. . : . '350 

AAT AGT ACT TGG AAT TTA AAT GGT ACT TGG AAT T.TT ACT. 1090 
50 Ash Ser Thr Trp Ash Leu Asn Gly Thr Trp Asn: Phe Thr 

355 ■. 360 ■■ 

GCA GGG TCA AAT GAA ACT GAA GGC AAT AT.C ACA CTC CCA 1129 
Ala Gly Ser Asn Glu Thr Glu Gly Asn He Thr Leu Pro 

55 - 365 ■ - ; 370 • .■ _ 375 

TGC AGA ATA AAA CAA ATT ATA AAC AGG TGG CAG GAA GTA 1168 
Cys; Arg lie Ly9 Gin lie lie ; Asn Arg Trp Glh Glu Val 

380 . .335 : 



60 



GGA AAA GCA ATG TAT GCC CCT CCC ATC ACT GGA CAA ATA 1207 
Gly Lys Ala Met Tyr Ala Pro Pro I le : Ser Gly Gin He 
.390 395. ••" 400 
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AGA 
Arg 


TGC 
Cys 


TCA 
ser 
405 


TCA 
Ser 


AAC 
Asn 


ATT 

lie: 


ACA. 
Thr 


GGG ATG iATA TTA:: 
Gly MetV l ie -Lieu 

410 : :: : ./:":;7:; ;: ::"'. ".: 


ACA 

: .Th :, r': 


AGG 
Arg 

415: 


1246 


■ K : ■■ 


GAT GGT 
Asp Gly 


GGT 
Gly 


AAC 
Asn 


GAG 
Glu 
420 


AAcT 
Asn 


AAT 
Asn 


AAT GAG AGC : AGT ACT 
Asn Glu S er : Ser Thr 

::.- :;425 : : v : .- : : : ;.;- : :: :; 


ACT.: 
Thr 


1285 


10 


GAG 

. Glu 


ACG 
Thr 
430. 


TTC 
Phe 


AGA 
Arg 


CCG 

Pro 


GGA 
Gly 


GGA 
Gly 


GGA GAT ATG AGG 
Gly, Asp Met Arg 


AAC 
Ash 
440 


AAT 
Asn 


1324 


15 


TGG AGA 
Trp Arg 


AGT 
Ser 


GAA 
Glu 
445 


TTA 
Leu 


TAT 
Tyr 


AAA 
Lys 


TAT AAA GTA GTA 
Tyr Lys Val Val 

4 so;. 


AAA 
Lys 


ATT 
lie 


1363 




.GAG 
Glu 
455 


CCA 
Pro 


TTA 
Leu 


GGA 
Gly 


GTA 
Val 


GCA 
Ala 
460 


CCC 
Pro 


ACC GAC TCT AGA 
Thr Asp Ser Arg 
: .465 


GGA 
Gly 


TCC 
Ser 


1402 


20 


TCT 


AGA 


1408 


















Ser 


Arg 





















4 0 



50 



55 



469 

25 (2) INFORMATION FOR SEQ ID NO: 16: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 469 amino acids . 
. (B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 
30 . (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: . : 

Val Pro val Trp Lys Glu Ala Thr Thr Thr. Leu Phe Cys Ala Ser 

1 5 , . IP. Xb 

3 5 Asp Ala Arg Ala Tyr Asp Thr Glu Val His : Asn ; Val Trp Ala Thr 

20 v ..25 . - JU 

His' Ala Cys Val Pro Thr Asp Pro; Ser Pro Gin Glu Val Phe Lieu 

: . . •. 35 40 45 

Glv Asn Val Thr . Glu Asn Phe Asn Met; Trp, Lys Asn Asn Met Val 

■ ■ •' . " 50 -v;y^ /SSI /, . 60 

Glu Gin Met Tyr Glu Asp He lU. Se"rM:&:^p:.Gln Ser 
: . 4.5 - : v 65 . .. -; : 7 b 

Lys Pro Cys Val Lys Leu Thr = Pro .Lev. Cy s -Val Thr Leu Asn Cys 



80 



Ser Asp Tyr Arg Asn Ala Thr Asp T y r L?y s Ash Ala Thr Asp Thr 



95 



Thr Ser Ser Asn Glu Gly Lys Met Glu Ar| Gly Glu lie Lys Asn 



110 



Cys Ser Phe Asa lie Thr Thr Ser Tie Lys Asn Lys Met Gin Lys 
- 130 . -135 



125 



Glu Tyr Ala Leu Phe Tyr Lys : Leu : Asn Ile : .Val. Pro He Asp Asn 

60 140 . . 14-S:.V;/ . 15P' 

■ Thr Ser Tyr Thr Leu lie Ser Cys Ash Thr Ser Val lie Thr Gin, 

155 160 ' ; . . 165 
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Ala Cys Pro Lys Va l Ser^ Phe G lu Pro lie Pro lie His Tyr = Cys 

;/• .•; " .170: -V / >.v 175;" Y . : 180 

Ala Pro Ala Gly Phe Ala l ie Leu - Lys Cys. Asn: Asp; Lys Lys Phe 

5 185 • ; V;Y:: : . r : .v ; 1?0 ' .195 

Ser Gly Lys Gly Glu Cys Lys Ash Val Ser Thr Vai Gin Cys Thr 

200 • 205 . 210 

10 His Gly lie Arg Pro Val Val Ser Thr Gin Leu Leu Leu Asn Gly 

215 220 . .. 225 



15 



30 



45 



60 



Ser Leu Ala Glu Glu Glu; -Val Val lie Arg Ser Asp Asn Phe Thr 

230 • . 23.5. ; ' 240 

Asp Asn Thr Lys Thr lie lie Val Gin Leu Lys Glu Ser Val Glu 

24.5 . . 250 , 255 



lie Asn Cys lie Arg Pro Asn Asn Asn Thr Arg Lys Gly lie His 

20 260. ; . 265 . .. 270 

lie Gly Pro Gly Arg Ala Trp : Tyr Ala Thr Gly Glu lie Val Gly 

27 5 . : 280 . 285 

2.5 Asp lie Arg Gin Ala Tyr Cys Asn lie Ser Arg Thr Lys Trp Asn 

290 295 300 



Asn Thr. : Leu lie Gin I le Ala Ash ; Lys Leu Lys Glu Lys Tyr Asn 

305 ..310- 315 

Thr Thr lie Ser Phe Asn Arg Ser Ser Gly Gly Asp Pro Glu lie 

320 32 5 330 



Val Thr His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn 
35 335 .340 .345 

Ser. Thr Gin Leu Phe Asn Ser Thr Trp Ash Leu Asn Gly Thr Tirp 
350 ' 355 ; . 360 

.4 0 Ash Phe Thr Ala Gly Ser . Asn Glu : Thr Glu Gly Asn lie : Thr Leu 

365 , . ■ . •• 370 • 375 



Pro Cys Arg l ie Lys Gin I le :1 le. Asn : Arg Trp Gin Glu Val Gly 

. 380 :: : ; : - : .385 • :.• 390 

Lys Ala Met Tyr Ala Pro . Pro l ie . Ser Gly Gin l ie Arg Cys Ser 

395 '• :400 405 



Ser Asn I le Thr Gly Met lie. Leu Thr Arg Asp Gly Gly Ash Glu 

50 410 • ;"• 415 ■ 420 

Asn Asn Ash Glu Ser Ser Thr Thr Glu Thr Phe Arg Pro Gly Gly 

425 '• 4 30':.'. , • . : 435 

55 Gly Asp Met Arg Asn Ash Trp Arg Ser Glu Leu Tyr Lys Tyr Lys 

. •. 440 445 4 50 



.:. Val Val Lys He Glu Pro. Leu Gly : Val. Ala Pro Thr Asp Ser Arg 

455 ■ :"4 60' ' • . : 465 

• Gly Ser Ser Arg 

469 ...., : VV. :•'•• '. •:'•. . • ' . ' • ' • 



(2) INFORMATION FOR SEQ ID NO: 17: 
65 (i) SEQUENCE CHARACTERISTIC S i ; . 
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'■' '(A)- LENGTHi'. : 1499.;Msev:pa : ^rs, : v 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: . S ingle' •;' 

(D) ' TOPOLOGY : Linear 7 
7 ( xi ); SEQUENCE DESCRIPTION : : SEQ ID NO : 1 7 v 

V GAG GTA CCt GTG TGG AAA G A A . GCA : ACC : ACT ACT . CT A 3 6 
Glu Val Pro Val Trp Lys Glu Ala Thr Thr Thr. Leu : 
; i . 5 • *9 

TTT TGT GCA TCA GAT GCT AAA GCA TAT GAC : ACA GGG GTG 75 

Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp Thr .Gly vVal 

15. . ■ 20 . V,.'. y.r'r' " 

15 CAT AAT GTT TGG GCC ACA CAT GCC TGT GTA CCC ACA GAC 114 

= ■ . His Asn Val Trp Ala thr His Ala Cys. Val Pro Thr Asp , : 



10 



3 0 



40 



45 



50 



30 



CCC A AC CCA CAA GAA ATA GAA TTG GTA AAT GTG ACA G^A 153. 

20 Pro Asn Pro Gin Glu lie Glu Leu Val Asn Val Thr Glu ;v 

. .. q At txt aac atg tgg AAA AAT AAA ATG GTA GAC CAG : ATG 192. 

■ Asp Phe Asn Met Trp Lys Asn Lys Met Val Asp. Gin. Met . 

,25 .-■ , ■ 55 • •.; ■ *° ■ 

CAT GAG GAT ATA ATC AGT TTA TGG GAT GAA AGC. C^A .A^ 

His Glu Asp l ie lie Ser Leu Trp Asp Glu Ser;; Leu, Lys 

.-.■.65 ■ 70 ': 7 - 5 V. V. 



CCA TGT GTA A AG TTA ACC CCA CTT TGT GTT ACT CTA ; AAC; 270 

Pro Cys Val Lys Leu Thr Pro Leu Cys. Val Thr Leu Asn 

. 80 ■■ ■ 85 °° 

3 5 TGC AGT GAT GTG AAC AAT TCC ACA AAT CCT AAT GAT ACT 309 

Cys Ser : Asp Val Asn Asn Ser Thr Asn Prq Asn Asp. Thr. . . . 



95 



AAT ACT AAT TCC ACT AAT ACT ACT TCC TCT ACT CCT. £CG : 34 8 
Asn Thr Asn Ser Thr Asn Thr Thr Ser Ser Thr Prp Thr .... 

" 105 : .. ■ HO.. ■ " " , . 

GCC ACT ACT AGT AG C GAG GAA AAG ATG GAG A AG GG A GAA 387 
Ala Thr Thr Ser Ser Glu Glu Lys Met Glu Lys Gly Glu 

V : '\ . 12 0 •.- - • ; ' 125 : ■ ..:;_:,•,,;;;•:._.,, 

ATA AAA AAC TGC TCT TTC AAT ATC ACC ACA CAC ATG AAA 426 
lie Lys Asn cys Ser Phe Asn lie Thr Thr His Met Lys.. 

130 135 . J140 

GAT AAG GCA CAG AAA GAA TAT GCA CTT TTT TAT AAA: CTT 4.65, 
Asp Lys Ala Gin Lys Glu Tyr Ala Leu Phe Tyr Lys Leu ,; : 

V '• 145 ■ 150 • ; V : • : : •• '•• 15 ^':;;. : 

■55=11. ^ GAT ATA GTA CCA ATA GAT GAT AAT AAT GCC ACC TAT AGG 504 
Asp lie Val Pro lie Asp Asp Asn Asn : Ala Ser Tyr Arg . ... 

160 V 16 5 : : . 

fTG ATA AGT TGT AAT ACC TCA GAC ATT ACA CAG GCC TGT 54 3 
60 Leu lie Ser Cys Asn Thr Ser Asp lie Thr . Gin Ala Cys 

: 170 175 . . : : .1 8 .P. . 

: , .''cCA AAG GTG ACC TTT GAG CCA ATT CCC ATA CAT: TAT' TGT 582 
: Pro Lvs Val Thr Phe Glu Pro lie Pro lie His Tyr Cys . . 

65 - "... 185 . " 190 ... 
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GCC CCG GCT: GGT TTT : GCG ATT. CTA ;A AG TGT AAA GAT A AG 621 
Ala Pro Ala : Gly Phe :Ala Ile'^Leu^ Lys Cys Lys Asp : Lys 

195- ■ \-200 : a; : ;V.' ; . :; . ::; -.v ;:::: 205 

5 aag ttc aat; gga ac a gg ft : cc a : ■ tg t - tca a ag , g tc ag c . . a ca 660 
Lys Phe Asn Gly Thr Gly Pro Cys. Ser. Lys Val Ser Thr 

210 '- V V:::': 2 IS ' - . 220 

GTA CAA TGT ACA CAT GGA ATT AGG CCA GTA GTA TCA ACT 699 
10 Val Gin Cys Thr His Gly lie Arg Prb : Val Val Ser Thr 

225 2 30 

CAA CTG TTG TTA AAT GGC AGT CTT GCA GAA GAA G A A GTA 738 
Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val 
15 235 - 240 245 

GTA ATT AG A TCT GTC AAT TTC ACA. GAC AAT GCT AAA ATC 777 

Val lie Arg Ser Val Asn Phe Thr Asp Asn Ala Lys lie 

. 250 255 

20 

ATA ATA GTA CAG CTG AAA GAA CCT. GTA : CCA ATT AAT TGT 816 

lie Tie Val Gin Leu Lys Glu Pro Val Ala I le Asn Cys 

260 * 265 . 270 

25 ACA AGA CCC AAC AAC AAT ACA AG A AAA GGT ATA CAT CTA 855 

Thr Arg Pro Asn Asn Asn Thr. Arg Lys Gly lie His Leu 
. 275 ■ 280 285 

GGA CCA GGG AGC ACA TTT TAT ACA ACA GGA GAA ATA ATA 894 
30 Gly Pro Gly Ser Thr Phe Tyr Thr Thr Gly Glu Lie He 

290 : 295 

GGA GAC ATA AGA AAA GCA TAT TGC AAG ATT AGT AAA GAA 93 3 
Gly Asp lie Arg Lys Ala Tyr Cys Lys lie Ser Lys Glu 

3 5 300 305 310 

AAA TGG AAT AAC ACT TTA AGA CAG GTA GTT AAA AAA TTA 972. 

Lys Trp Asn Asn Thr Leu Arig ..Gin; Val Val Lys Lys Leu 

315- '■ 320 

4 0 

AGA GAA CAA TTT GGG AAT AAA: ACA ATA ATT TTT AAT CGA 1011 

. Arg Glu Gin Phe Gly Asn ;Ly.s Thr lie lie Phe Asn Arg . .. ^ 

325 - 330: V \ -33b,- \. 

4 5 TCC TCA GGA GGG GAC CCA GAA: ATT GTA ATG CAC AGT TTT 1050 

Ser Ser Gly Gly Asp Pro Glu lie Val Met His Ser Phe 

340 ■ : ':.. : ':34 5 350 

AAC TGT GGA GGG GAG TTT TTC T AC TGT. AAT ACA ACA CAA 1089 
50 Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn : Thr Thr Gin 

35 5 ;'. ; 360 

CTG TTT AAT AGT ACT TGG AAT. :^T ACT GAA; GGG ACA AAT : 1128 
Leu Phe Asn Ser Thr Trp Asn Asn Thr Glu Gly Thr Asn : 

'55' 365 ' ';3 : 70 : ;'. :i .'.':..' • : . ' 315 

AGC ACT, GAA GGA AAT AGC ACA ATC ACA CTC CCA TGC AGA 1167 
Ser Thr Glu Gly Asn Ser Thr lie Thr Leu Pro Cys Arg 

380 • :: : 385 



60 



ATA AAA CAA ATT ATA AAT ATG : TGG CAG GAA GTA GGA AAA 1206 
He Lys Gin lie He Asn .Met Trp Gin Glu Val Gly Lys 
390 • 395 . 400 
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■'^ : ' : CCk AGG TAT GCC GCT CCC ATC . AGA GGA CGA ATT AGA TGC 1245 , 
Ala Thr Tyr Ala Pro Pro lie Arg Gly Arg lie Arg; ^y.s ; .. ■ :: , ;V 

■■ ::: V:::;':' : V-'V'. : :-... 405- ; ' ' :: ' 410 : :.. . .4 : * ; : " 

5 ' ATA TdA AAT ATT ACA GGA CTG ,CTA TTA ACA AGA GAT GGT 12^4 
He Ser Asn lie Thr Gly. Leu Leu Leu Thr Arg Asp: Gly : ;. . 

■ ■:■:> : :: : .:v ; ::.: ; .;. ; ;42'0;- • * 25 ; , : V \ vaV-v; 1 ;---. : -; ; : ; v 

: GGT AGG AAT GTC ACA AAC A AT ACC GAA ACC TTC AGA QCT 1323 
10 "JVGly Arg. : Asn Val Thr Asn Asrv Thr Glu Thr Phe Arg Pro - . ... 
430 ■ ■ ■ 435 440 

" GGA GGA GGA GAC ATG AGG GAC AAT. TGG AGA AGT GAA TTA . 1362 .:. 
Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu 

is ; ' ; 445 ■ " ■ 4S0 

. TAT AAA TAT AAA GTA GTA AAA GTT GAA CCA TTA GGA ATA 1401 
Tyr Lys Tyr Lys Val Val Lys Val Glu Pro Leu Gly lie 

' V;- : 455' : .'/ : ' ■ .460 ■' ■ ■ 

20 GCA CCC ACC A AG GCA AAG AGA AGA GTG GTG CAC AGA GAC 1440 ; 

Ala Pro Thr Lys. Ala Lys Arg Arg Val Val His Arg Asp 

470 475 ; '4-80 : . 

25 AAA AGA GCA GCA CTA GGA GCC TTG TTC CTT GGG TTC TTA .1479 

Lys Arg Ala Ala Leu Gly Ala Leu Phe Leu Gly Phe; Leu . . 

. : : ":: y '- :r ; ; "; 485 , 490 

, GGA GCA TAA AAG CTT CTA GA 1499 
3 0 G ly A l a X a a L>y s Le u Leu . 

.495- - 499 ' . ' ;* ^ 

■ (2 ) INFORMATION FOR SEQ ID NO: IB.: 
( i) SEQUENCE CHARACTERISTICS : 
35 (A) LENGTH: 499 amino acids 

(B.) TYPE: Amino Acid ... 
(D) TOPOLOGY: Linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

40 Glu Val Pro Val Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala 

Ser Asp Ala Lys Ala Tyr Asp Thr Gly Val His Asn Val /Trp Ala.; 

. 20 25 - 

45 Thr His ^ Ala Cys Val Pro Thr Asp Pro Asn Pro Gin Glu lie Glu 

35 . 4-0 ■ ■ ■ _ .: H :P: 

! Leu Val Asn Val Thr Glu Asp Phe Asn Met Trp Lys Asn/ Lys : Met 

50, '50 55 - ■ .; . ,60' 

Val Asp Gin Met His Glu Asp Tie He Ser Leu Trp : Asp Glu Ser 

: .' V U-S 65 7° 75 

55 Leu Lys Pro Cys Val Lys Leu thr Pro Leu Cys Val Thr. Leu Asn 



60 



80 



Cys Ser Asp Val Asn Asn Ser thr Asn Pro Asn Asp Thr Asn Thr 

; ; ; : 95 100 . 1 v.? 

Asn Ser Thr Asn Thr Thr Ser Ser Thr Pro Thr Ala. Thr .Thr. Ser; 

HO ' 115 14S> 

: Ser Glu Glu , Lys Met Glu Lys Gly Glu lie Lys Ash Cys Ser Phe 

'■ : 6S' v^;v^.'-:W- : '.'V';;:.- :: . : •' . 130 • r - - 

■■^/- ■ ' : : .' :,' -127- ' . ;, . '--VY 
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25 



30 



35 



40 



45 



50 



=55- 



60 



65 



■Ash He Thr 
Leu Phe Tyr 

Ser Tyr Arg 

Cys Pro Lys 
Pro A let Gly 

Gly Thr Gly 

Gly lie Arg 

Leu Ala Glu 

Asn Ala Lys 

Asn Cys Thr 

Gly Pro Gly 

lie Arg Lys 

Thr Leu Arg 

Lys Thr He 

Val. Met His 

Thr Thr Gin 

Asn Ser Thr 

Lys Gin He 

Ala Pro Pro 

Gly Leu Leu 

Thr . Glu Thr 

Arg Ser Glu 



Thr His Met . Lys . Asp Lys Ala: Gin Lys Glu Tyr Ala 

. : 140 145, .150 

Lys Leu Asp lie. Val Pro lie Asp Asp;;Asn Asn Ala 

:. 155 160 165 

Leu lie Ser Cys Asn Thr Ser Asp lie Thr Gin Ala 

170 17 5 . . . 180 

Val Thr Phe Glu Pro lie Pro He His Tyr Cys Ala 

185 190 195 

Phe Ala lie Leu Lys Cys Lys Asp Lys Lys Phe Asn 

• 200 205 •• ' 210 

Pro Cys Ser Lys Val Ser thr Val Gin Cys Thr His 

215 220 225 

Pro Val Val Ser Thr "Gin Leu Leu Leu Asn Gly Ser 

230 235 240 

Glu Glu Val Val lie Arg Ser Val Asn Phe Thr Asp 

245 250 255 



He He: lie Val Gin ^Leu Lys Glu Pro Val Ala He 

260 ■■ 265 ... 270 

Arg Pro Asn Ash; Asn Thr Arg Lys Gly lie His: Leu 
275 ... 280 . . 285 

Ser Thr Phe Tyr Thr Thr Gly Glu He lie Gly Asp 
290 295 300 

Ala Tyr Cys Lys lie. Ser Lys Glu Lys Trp Asn Asn 
305 , 310 315 . 

Gin Val Val Lys Lys Leu Arg Glu Gin Phe Gly Asn 
320 325 . : 330 

lie Phe Asn Arg Ser Ser Gly Gly Asp Pro Glu lie 

335 - ■ : 340 • . . • • 345 

S e r Phe As hi Cys Gly. Gly Glu P he P he Tyr Cys A sn 

■ 3 50 ; ■ 355 ■ •• ; 360 

Leu Phe Asn Ser Thr Trp. Asn Asn Thr Glu Gly Thr 

365 ••" V 3 70 . • v 37 5 

Glu Gly Asn Ser Thr. Tie Thr Leu Pro Cys Arg lie 
380 . 385 . 390 

lie Asn Met Trp G 1 n Glu. Val Gly Lys Ala Thr Tyr 

• 395 '•• . 400 ■ 405 

He Arg Gly Arg He Arg Cys He Ser Asn I le Thr 

' 410 : . 415 ' 420 

Leu Thr Arg Asp Gly Gly Arg Asn Val Thr Ash Asn 
• 425 • ' • 430 435 

Phe Arg Pro Gly Gly Gly Asp Met Arg. Asp Asn Trp 
440 ; . 44 5 450 

Leu Tyr Lys Tyr Lys. Val Val Lys Val Glu Pro Leu 

455 •'. .'; 4 60 .• ••' 465 
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Gly lie Ala Pro Thr Lys Ala Lys Arg Arg yal Val His ArgAsp 

■ ' ■■" ' : : 470 ' ... ' :: 475 .. : • 480 

Lys hrg -Ala: ;*la- -Li* Gly Ala Leu PheLp Gly Phe Leu Gly A£| 

■ Xaa Lys Leu Leu 

■ • .499- 

10 (2) INFORMATION FOR SEQ ID NO: 19: . 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1499 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRAND EDNESS : Single 
15 (0) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19:. 

GAG GTA CCT GTA TGG AAA GAA GCA ACC ACT ACT CTA 36 
Glu Val Pro Val Trp Lys Glu Ala Thr Thr Thr Leu 

20 .1 / ■ 5 .' 10 

TTT TGT GCA TCA. GAT GOT AAA GCA. TAT GAC AC A GAG GTG 75 
Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp -Thr Glu Val 

•• • ■ 15 ■ ■ 20 . 25 

25 CAT AAT GTT TGGGCC AC A CAT GCC TGT GTA CCC AC A GAC 114 

His Ash Val Trp Ala Thr His Ala Cys Val Pro Thr Asp 

■■■ .30 . . ■ . 35 

30 CCC AAC CCA CAA GAA ATA GAA TTG. GTA AAT. GTG ACA GAA 153 V. 

Pro Asn Pro- Gin Glu. lie Glu Leu Val Asn Val Thr Glu . 

40 . . .45 SO 

GAT TTT AAC: ATG TGG AAA AAT AAA ATG GTA GAC CAG ATG 192 
3 5 Asp: Phe Ash Met Trp Lys Asn. Lys Met Val Asp Gin Met 

'. ':' .55.' • 60 

CAT GAG GAT ATA ATC AGT TTA TGG GAT GAA AGC CTA AAG 231 
His Glu Asp .lie lie Ser Leu Trp Asp Glu Ser Leu Lys 

40 65. \ : y-. 70 . 75 ■ 

CCA TGT GTA AAG TTA ACC GGA CTT TGT GTT ACT GTA AAC 2 70 : 
Pro Cys Val Lys. Leu Thr Pro Leu Cys Val Thr Leu Asn 

: ;;: : ;^8o , : V . ss ; ... . ?o- 

45 . TGC AGT GAT GTG AAC AAT TCC ACA AAT CCT AAT GAT ACT 309 
Cys Ser Asp Val Asn Asn Ser Thr Asn Pro Asn Asp Thr 

. ' 95 " 100 . 

50 AAT ACT AAT TCC ACT AAT ACT ACT TCC TCT ACT CCT. ACG 348 : 

Asn Thr Asn Ser Thr Asn Thr Thr Ser Ser Thr Pro Thr 

. 105 ■■■ ■ no 115 

GCC ACT ACT AGT AGC GAG GAA AAG ATG GAG AAG GGA GAA 387 
55 Ala Thr : Thr Ser Ser Glu Glu Lys Met Glu Lys Gly Glu 

120 ... ' v : ; 125 . , .. 

ATA AAA AAC TGC TCT TTC AAT ATC ACC ACA CAC ATG AAA 426 
lie Lys Asn Cys Ser Phe Asn lie Thr Thr .His Met Lys 

.60 " 130 : ; ■ 135 , 140 

GAT AAG GTA CAG AAA GAA TAT GCA . CTT TTT TAT AAA CTT 465 
Val " " 

14 5 



GAT AAG. GTA CAG AAA ^nn i «i yvo v * - - - - r-" 
Asp Lys Val Gin Lys Glu Tyr Ala Leu Phe Tyr Lys. Leu. 

' - ' ■ '• :150 ■ • 155 



65 
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V. .GAT: ATA GTA CCA ATA GAT GAT A AT : AAT: ACC AGC TAT AGG 504; 
Asp lie Val Pro II Asp Asp Asn Asn ^Thr :Ser: Tyir Arg 

' • • 160 •' . •" • ;' : v-; ■ ■ -16S ; : 

; 5 . TTG ATA AGT TGT:VaAT ACC TCA GTC ATT:;;AGA ! CAG GCC TGT 543 

; Leu lie Ser Cys Ash Thr Ser Val : Tie :: T.hr="GlW : -"AiW^Cy."s-' 

. 170 . : - ' ; .: 175 • V ^::V" V :VV^ ' 

CCA ATG GTG ACC TTT GAG CCA ATT CCC ATA CAT TAT TGT 582 

10 ; Pro Met Val Thr. Phe Glu Pro l ie Pro :: I leY H is Tyr Cys 
185 190 

GCC CCG GCT GGT TTT GCG ATT CT A A AG TGT AAA GAT A AG 621 

. Ala Pro Ala Gly Phe Ala lie Leu :.Lys Cys : xy.s Asp. Lys 
15 : 195 200 : 205 ; 

AAG TTC A AT GGA ACA GGA CCA TGT TCA A AG GTC AGC ACA 660 

Lys Phe Asn Gly Thr Gly Pro Cys Ser Lys 1 Val Ser Thr 

210 215 220 

20 

GTA CAA TGT. ACA CAT GGA ATT AGG CCA GTA. : GTA; TCA ACT 699 

Val Gin Cys Thr. His Gly lie Arg Pro Val Val Ser Thr 

22 5 : : 230y : . 

25 CAA CTG TTG TTA AAT GGC AGT CTT GCA: GAA GAA GAA GTA 738 

Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu G 1 u Val 
235 240' - .24 5 

. GTA ATT AGA TCT GTC AAT TTC ACA CAC AAT GCT AAA ATC .777 
30 . Val lie Arg Ser Val Asn Phe. thr Asp. Asn -Ala- Lys lie 
, . 250 .-• 2 5 5 ; ; ; •. • 

ATA ATA GTA CAG CTG AAA GAA CCT GTA GCA .ATT AAT TGT 816 
lie lie Val Gin Leu Lys Glu Pro Vai Ala Tie Ash Cys 
35 260 265 270 

ACA AGA CCC AAC AAC AAT ACA AGA AAA GCT ATA CAT CTA 855 
Thr Arg Pro Asn Asn Asn Thr Arg Ly s G 1 y :: i ie ; His Leu 
275 280 : . 285 



4 0 



60 



GGA CCA GGG AGC ACA TTT TAT ACA ACA GGA : GAA ATA ATA 894 
Gly Pro Gly Ser Thr Phe Tyr Thr Thr Gly Glu lie lie 

■ . 290 • V.; : 2 9:5 ; V--;r "v; : V;: 



4 5 GGA GAC ATA AGA AAA GCA TAT TGC AAG ATT AGT: AAA GAA 933 

Gly Asp lie Arg Lys Ala Tyr Cys Lys lie; Ser Lys Glu 

300 . ■ .305 •.' :: v.v.'. 310 ; . ■ • 

AAA TGG AAT AAC . ACT TTA AGA CAG GTA ^GTT AAA AAA TTA 972 

5Q : Lys Trp Asn Asn Thr Leu Arg Gin Val Val Lys Lys Leu 
315 320 

AGA GAA CAA . TTT GGG AAT AAA. ACA ATA ATT TTT AAT CGA 1011 

Arg Glu Gin Phe. Gly Asn Lys Thr lie lie:; Phe Asn Arg 

55 . ■. 325 ■ ■ ■ 330 ■ : 33.5. : y : - 

TCC TCA GGA GGG GAC CCA GAA . ATT GTA. ATG CAC .AGT TTT 1050 

Ser Ser Gly Gly Asp Pro Glu lie Val Het : . His. : Ser . Phe 
340 34 5. • - ^ 350 



AAC TGT GGA GGG GAG TTT TTC TAC TGT AAT ACA ACA CAA 1089 
Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn :Thr. : Thr Gin 

355 , • . :360; '^r ■■ • 
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10 



15 



20 



CTG TTT AAT AGT ACT TGG A At AAT ACT GAA GGG -AC A. -A AT. 1128 
LeuPhe Asn Ser Thr Trp Asn Asn Thr Glu Gly Thr Asn 

AGC ACT GAA <GG A: AAt: : AGC ACA: ATC ACA CTC CCA TGC AGA 1167 
Ser Thr Glu Gly Asn: Ser Thr lie thr Leu Pro Cys Arg 

': : ;vMo; ; ^ : ' - 38,5 ■ • 

ATA AAA GAA- ATT ATA! AAT ATG TGG CAG GAA GTA GGA AAA 1206 
lie Lys Gin lie ii^V Ash Met Trp Gin Glu Val Gly Lys 
390 : 395 400 

GCA ACG TAT GCC CCT CCC ATC AGA GGA CGA ATT. AGA TGC 1245 
Ala Thr Tyr Ala Pro Pro lie Arg Gly Arg lie Arg Cys 

.405.: 410 415 

ATA TCA AAT ATT ACA GGA CTG CTA TTA ACA AGA GAT GGT 1284 
lie Ser Asn lie Thr Gly Leu Leu Leu Thr Arg Asp Gly 

: 420:. ; 425 

GGT AGG AAT GTC ACA AAC AAT ACC GAN NCC T.TC AGA CCT 1323 
Glv Arg Asn Val Thr Asn Asn Thr Xaa Xaa Phe Arg Pro 
430 ' 435 ■ 440 

25 GGA GGA GGA GAG ATG AGG GAC AAT TGG AGA AGT GAA TTA 1362 

Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu 

"■'■:44s:y:.\. • 450 

TAT AAA TAT AAA GTA GTA AAA GTT GAA CCA TTA GGA ATA 14:01 
Tyr Lys Tyr Lys Val Val Lys Val Glu Pro Leu Gly He 
455 460: 4 65 

GCA CCC ACC A AG . GCA . A AG AGA AGA GTG GTG CAC AGA GAC 144.0 
Ala Pro thr Lys Ala Lys Arg Arg Val Val His Arg Asp 

470 . ■ 475 . 480 

AAA AGA GCA GCA CTA GGA GCT TTG TTC CTT GGG TTC TTA 1479 
Lys Arg Ala Ala Leu Gly: Ala Leu Phe Leu Gly Phe Leu 

• ' : ..; : ' :: 485 : 490 

GGA GCA TAA A.AG CTT CTA GA 1499 

Gly Ala Xaa Lys Leu Leu ' 

495 ^--^\ : -'-"' : ;:V'- : : ; :. ; 4^-;--.;-- 



30 



35 



40 



4 5 (2) INFORMATION: FOR; SEQ ID NO: 20 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 499 amino acids 
{: B ) T Y PE : . Amin b : . A c i d 
■ (D) TOPOLOGY:: Linear 
50 (xi) SEQUENCE DESCRIPTION : SEQ ID NO: 20: 



55 



60 



65 



Glu 

• a : 


Val 


Pro. 


Val: 


:Trp: 

.V' 5 


Lys 


Glu 


Ala 


Thr 


Thr 
10 


thr 


Leu 


Phe 


Cys 


Ala. 
15 


Ser 


Asp 


Ala 


Lys 


Ala 
20 


Tyr 


Asp 


Thr 


Glu 


Val 
25 


His 


:Asn 


Val 


trp 


Ala 
30 


Thr 


His 


Ala 


Cys: 


:val 
35 


Pro 


Thr 


Asp 


Pro 


Asn 
40 


Pro 


Gin 


Glu 


lie 


Giu 
45 


Leu 


Val 


Ash: 


Val 


Thr 
50 


Glu 


Asp 


Phe 


Asn 


Met 
55 


trp Lys 


Asn 


Lys 


Met 
60 


Val 


Asp 


Gin 


Met 


i His 
65 


Glu 


:Asp 


'.lie 


lie 


Ser 
70 


Leu 


Trp 


Asp 


Glu 


Ser 
75 
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v Leu Lys Pro Cys Val Lye Leu Thr Pro Leu ;eys Val Thr Leu Asn 

., 80 : v ; v : ' ' '. . . 85. - .90 

; Cys Ser Asp Val Asn AshrSer Thr Asn Pro Asn Asp Thr Asn Thr 

. v5 'V 95 :■'.::::,■=•.=;=:■ 100 -105 

•;• Ash Ser Thr Asn Thr Thr . Ser Ser Thr Pro. Thr Ala Thr Thr Ser 

110 11 5; ••• " _ 120 

10 . Ser Glu Glu Lys Met Glu Lys Gly Glu lie Lys Asn Cys Ser Phe 

. . 125 . 130 135 



15 



30 



45 



60 



Asn lie Thr Thr His Met Lys Asp Lys Val Gin Lys Glu Tyr Ala 

140 145 ' . 150 

Leu Phe Tyr Lys Leu Asp lie Val Pro lie Asp Asp Asn Asn Thr 

155 160 165 



Ser Tyr Arg Leu lie Ser Cys Asn Thr Ser Val lie Thr Gin Ala 

20 170 175 ... 180 

.Cys Pro Met Val Thr Phe Glu Pro lie Pro lie -His Tyr Cys Ala 

185 190 195 

25 : Pro Ala Gly Phe Ala lie Leu Lys Cys Lys Asp Lys Lys, Phe Ash 

200 . 205 210 



Gly Thr Gly Pro Cys Ser Lys Val Ser Thr Val Gin Cys Thr His. 

215 220 225 



G l.y lie Arg Pro Val Val Ser Thr Gin Leu . Leu Leu Asn Gly Ser 
230 235 240 



Leu Ala Glu Glu Glu Val Val lie Arg Ser Val Asn: Phe Thr Asp 

3 5 245 • 250 255 

: Asn Ala Lys lie lie lie Val Gin Leu Lys Glu Pro Val Ala lie 

260 ■ . 265 : 270 

4-0 : . Asn Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Gly lie His Leu 

275- ••• 280 •,.' : V;v • -285 

Gly Pro Gly Ser Thr Phe tyr thr Thr Gly Glu lie lie Gly Asp 

290 295- ' '.' 300 

lie. Arg Lys Ala Tyr Cys Lys lie Ser Lys Glu Lys. Trp Asn Asn 

305 .' . 310: : . . • 315 • 



thr Leu Arg Gin Val Val Lys Lys Leu Arg Glu Glri Phe Gly. Asn 

50 . 320 32 5 . 330 

Lys Thr lie lie Phe Asn Arg Ser Ser Gly Gly Asp Pro Glu lie 

' .. 335 . : . 340 345. 

55. Val Met His Ser Phe Ash Cys Gly Gly Glu Phe Phe. Tyr Cys Asn 

_■• 350 355 . ' . 360 



Thr Thr Gin Leu Phe Ash Ser Thr Trp Ash Asn Thr Glu Gly Thr 

365 370 375 

Ash Ser Thr Glu Gly Asn Ser Thr lie Thr Leu Pro Cys Arg lie 

380 385 '. 390 



Lys Gin lie lie Ash Met Trp Gin Glu Val. Gly Lys Ala Thr tyr 

65 ■ : ' 395'. ' 4.00 " .. :•. • 405 
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Ala Pro Pro lie Arg Gly Ang iV.V Arg Q ys n e Ser Asri lie % 

• 410 :':".".•:'• f 1 ^ . :;. ■■ ' . : 

Gly Leu Leu : Leu Thr Arg Asp Gly Gly Arg Ash-Val Thr Asn , 

5 ' •; . 425 ; : . r-y}\ :: : A30;; ;. .. 43-5 ■. 

Thr Xaa Xaa Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp 

■ 440" •• ' 44 5 • . AbU 

10 Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys Val Glu Pro L^u 

455 .460 . 

Gly Me Ala Pro Thr Lys Ala Lys Arg Arg Val Val His Arg Asp 

470 475 480 

15 Lys Arg Ala Ala Leu Gly Ala Leu Phe Leu Gly Phe Leu. Gly Ala 
. 4R(^ 490 



40 



4 5 



55 



60 



65 



Xaa Lys Leu Leu 
20 499 

(2) INFORMATION FOR SEQ ID NO: 21: • 
(i) SEQUENCE CHARACTERISTICS: .. . 
(A) LENGTH: 1475 : base pairs 

2 5 (B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single:. :: 

(D) TOPOLOGY: Linear; 

( xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

3 0 G GTA CCT GTG TGG AAA GAA GCA AAC ACA ACT CTA TTT 3 7 

Val Pro Val Trp Lys Glu Ala Asn Th r Thr Leu Phe 

I 5 ■ . 10 



TGT GCA TCA GAT GCT AAA GCA TAT, GAT AG A GAA GTA CAT 7 6 

Cys Ala Ser Asp Ala Lys Ala Tyr Asp, Arg Glu Val His 

; • .« : ^ 20 ■ .-. . 25 ■ 

AAT GTT TGG GCA ACA CAT GCC TGT GTA CCC ACA GAC CCC 115, 
Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro 

. 30 ' . • 35. 

AAC CCA CAA GAA ATA GTA TTG GGA AAT GTG ACA GAA AAT 154 

Asn Pro Gin Glu lie Val Leu; Gly: Thr Glu Asn . .. 

: ■ = , : 40 : ; . ...... . : ■ ;; & { ^^p; : r . . 50 / 7 

TTT AAC ATG TGG AAA AAT AAC: ATG GTA GAA CAA ATG CAT 193 
Phe Ash Met Trp Lys Asn Asn Met Val Glu Gin Met His 

55 : ." : ' : ':vV 60 W: 

5b GAG GAT ATA ATC AAT TTA TGG GAT. CAA AGC TTA AAG CCA 2 32 

Glu Asp I le I le Ash Leu Trp Asp Gin- Ser, Leu Lys Pro. 

TGT GTA AAG TTA ACT CCA CTC TGT G TT ACT TTA AAG TGC 271. 
Cys Val Lys Leu Thr Pro Leu Cys. Val Thr Leu Lys Cys 

■ ■ so ;. ; ' 90 

AAG GAT CTG GAG AGG AAT ACT' ACC : TAT AAT AGC ACT ATT 310 
Lys Asp Leu Glu Arg Asn; Thr Thr Tyr Asn Ser Thr He 
" 95 100 

ACC AAT AAT AGT AGT TTG GAG GGA CTA. AG A GAA CAA ATG 349 
Thr Ash Ash Ser Ser Leu Glu Gly Leu : Arg Glu Gin Met. 

, ~- ' • -1 i n-'.. - '-' 115 



105 • . yli0 : j:-.y-:^---.:r .115 

' ■■:"■:='=' -13 3-': 
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ACA AAC TGC TCT TTC AAC ATC ACC ACA AGT ATA AG A GAT 388: 
Thr Asn Cys Ser Phe Asn : lie Thr Thr Ser lie Arg Asp : 7 

■ ■ :[ 'y-^- :]: ::r' ]: r . ; \'."". 120.- v. • '' ;: ' : ■•■■/ : .. : . : v 12 s . .. .. - --v 

5 AAG GTG CAG AAA GAA. TAT GCA CTT TTG TAT AAA CTT GAT 427 

;- : Lys Val Gin Lys Glu Tyr Ala Leu Leu Ty r Ly s Leu- Asp : . .. 

. . .\. ; '-vl30v V : ;vV'- : ; : - : '' : ;..' : ' •■ 135 : , .. '• 140 V 

GTA GTA CGA ATA GAA GAA GAT GAC A AT ACT AGC TAT AGA 466 
10 Val Val Pro lie Glu Glu Asp. Asp Asn Thr Ser Tyr. Arg 

: 145 150 . 155 . 

TTG ATA AGT TGT AAC ACC TCA GTC ATT ACA CAG GCT TGT 505 
Leu lie Ser Cys Asn Thr Ser Val lie Thr Gin Ala Cys . 
15 160 165 



20 



40 



60 



CCA A AG ACA TCC TTT GAG CCA ATT CCC ATA CAT TAT TGT 544. 

Pro Lys Thr Ser Phe Glu Pro lie Pro lie His Tyr Cys 

170 175 180 

GCC CCG GCT GGT TTT GCG ATT CTA AAG TGT AAT GAT AAG .583 

Ala Pro- Ala Gly Phe Ala lie Leu Lys Cys Asn Asp Lys. 

185 190 



25 . AAG TTC AAT GGA ACA GGA CCA TGT AAA AAT CtC ACC ACA 622 

Lys Phe Asn. Gly thr Gly Pro Cys Lys Asn Val Ser Thr 

195 . 200 . 205 

GTA CAA TCT ACA CAT GGA ATT AGC CCA GTA GTA TCA ACT 661 

30 Val Gin Cys Thr His-Gly lie Arg Pro Val Val Ser Thr : 

; 210 . . 215 220 ; 

CAA CTG TTG.TTA AAT GGC AGT CTA GCA GAA GAA CAG GTA 700 

Gin Leu Leu Leu Asn Gly Ser . Leu Ala Glu Glu Glu Val 

3 5 . 225 230 . 



GTA ATC AGA TCT GCC AAT TTC ACA GAC AAT GCT AAA ACC 739. 

Val lie Arg Ser Ala Asn Phe Thr Asp Asn Ala Lys Thr. 
235 . 240 " 245 

ATA ATA GTA CAT CTA AAT GAA ACT GTA AAA ATT AAT TGT 778. 

lie lie Val .His Leu Asn Glu Thr Val Lye He Asn Cys : . 

• , •;• •.- • .;..•;••;:/:..;•. 250 .• 255 . . - : ,-, • v : . : -; 

4 5 ACA AGA CTT GGC AAC AAT ACA AGA AAA AGT ATA AAT ATA 817 

Thr Arg Leu Gly Asn Asn thr Arg. Lys Ser He Asn He 

260 -.' , 265 •• . . 270:. 

GGA CCA GGG AGA GTA CTC TAT GCA ACA GGA GAA ATA ATA 856 

50 Gly Pro Gly Arg Val Leu, tyr Ala Thr Gly Glu He lie 

275 : 280 ■ ■ ■ 285 

. GGA GAC ATA ; . AGA CAA GCA CAT TGT AAC ATT AGT AGA GCA 895 

Gly Asp lie Arg Gin Ala His Cys Asn He Ser Arg Ala 

55 .■; '290 ' 295 . 

: CAA TGG AAT AAG ACT TTA GAA AAG GTA GTT GAC AAA TTA 934 

Gin Trp Ash Lys Thr Leu Glu Lys Val Val Asp Lys Leu. 
300 . 305 310 



AGA AAA CAA TTT GGG GAT AAT ACA ACA ATA GCT TTT AAT .973 
Arg Lys Gin Phe Gly Asp Asn thr thr He Ala Phe Asn 

• 315 ' 320 ■ 
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CGA tec TCA GGA GGG GAC CCA GAA ATT GTA : .ATG CAC ^c T 1012 
. : . Arg Ser Ser Gly Giy Asp Pro Glu lie Val Met .His Thr 
325 :: . 33 °: ,. . ■ v 3 .? • V'- ; 

5 TT.T AAT TGT GGA GGG GAA TTT TTC T AC TG T.. : AAT . AC A ACA 1051 
Phe Asn Cys Gly Gly: Glu Phe Phe Tyr Cys : Asn. Thr *M 

. 345. .V.r.TT ' 



10 



20 



3 0 



35 



40 



60 



65 



340 



CAA CTG TTT AAT AGT ACT. TGG AAT AAT ACT TGG :: AAG GAT 1090, 
Gin Leu Phe Asn Ser. Thr Trp Asn Asn Thr Trp Lys Asp 
355 ?60 . 



CCT AAC AGG AGT GAC AAT ATC ACA CTC CCA TGC AGA. ATA 112? 
Pro Asn Arg Ser Asp Asn lie Thr Leu Pro Cys Arg He 

:i5 V : ; • : 365 ' 370 ■ • , -375. ; 



AAA CAA ATT ATA AAC ATG TGG CAG GAA GTA GGA AAA QPA 1168 

Lys Gin lie lie Asn Met Trp Gin Glu Val Gly Lys Ala- 
■ ' 380 ' 385 

ATG TAC GCC CCT CCC ATC AGA GGG GAA : ATT AGA TGT TCA 1207 

Met Tyr Ala Pro Pro He Arg Gly Glu lie Arg Cys Ser 
.390 395 400. 

25 TCA AAT ATC ACA GGG CTG CTA CTA ACA AGA GAT GOT GGT 1246 

Ser Asn lie Thr Gly Leu Leu Leu Thr Arg; Asp . Gly Gly 
• 410 415 



40.5 



AAT GAC GAT GGT AAT GAC ACG ACC ACA; AAC : AGG ACC GAG 1285 
Asn Asp Asp Gly Asn Asp Thr Thr Thr = Asn i Arg Thr Glu 
420 425 

ATC TTC AGA CCT GGA GGA GGA GAT ATG AGG GAC AAT TGG 1324 
: lie Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp 
430 435 : 

AGA AGT GAA TTA TAT AGA TAT AAA GTA GTA AAA ATT GAA 1363 
Arg Ser Glu Leu Tyr Arg Tyr Lys Val Val. .Lys lie Glu 
445 45Q ; ...v- 

CCA TTA GGA ATA GCA CCC ACC AGG GCA AAG AGA AGA CTG 1402 
Pro Leu Gly lie Ala Pro Thr Arg Ala Lys Arg Arg Val 

' : • 455 . ' ■ ' . 460' . 465 

: 45- GTG CAG AGA GAA AAA AGA GCA GTA GGA , CTA- GGA GCT TTG 1441 

" val Gin Arg Glu Lys Arg Ala Val Gly; Leu ; Gly Ala Leu 
■ ".■'■■■470. . 475 480 

TTC CTT GGG T TCTTAGGAG CATAAAGCTT . CTAGA 14 75 '•: 
50 Phe Leu Gly 

. . 483 ■ . . • ; • • -••:.:. : :;:; :\: "..j" V' y ' ' 

(2) INFORMATION FOR SEQ ID - NO: 2 2: . .V . ::; • v 
(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: . 491 amino acids; ■ 

{ B) TVPE: Amino Acid 
(D) TOPOLOGY: Linear 
(xi) . SEQUENCE. DESCRIPTION: SEQ ID NO: 22 : 



Val Pro Val Trp Lys Glu Ala Asn Thr Thr Leu , Phe Cys Ala Ser 

■ x . 5. , 10; lb 

Asp Ala Lys Ala Tyr : Asp Arg Glu Val :i His Ash Val Trp Ala Thr 



20 
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..- His. Ala Cys Val Pro Thr Asp-Pro-- Asn Pro Gin Glu lie Val Leu 

:: : ^ ^ Glu Asn Phe Asn Met Trp Lys Asn Asn Met Val 

•;..'5..: ' : v. : ;; '.\Y:., : • . 50 _ i _ ■ ■ ■ . 55 : •• ••• :: --'r :::: ' 60 

v . Glu .Gin; Met His Glu Asp lie lie Asn Leu: t r p As p G 1 n : : :S e r Lie u 

• • :'•>•'..'•'•• 65 • ■ ■ 70 : . ; 75 

10 : Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu . Lys Cys 

80 85 .90 

Lys Asp Leu Glu Arg Asn Thr Thr Tyr Asn Ser Thr . lie Thr Asn 

95 100 ' ■ ■ . 105 

15 . 

Asn Ser Ser Leu Glu Gly Leu Arg Glu Gin Met Thr Asn Cys Ser 

110 115 120 

Phe Asn lie Thr Thr Ser lie Arg Asp Lys Val Gin Lys Glu Tyr 

20 ■: ■ 125 130 . 135 



Ala Leu Leu Tyr Lys Leu Asp Val Val Pro lie Glu Glu Asp Asp 

140 145 '•; 150 

25 . Asn Thr Ser Tyr Arg Leu lie Ser Cys Asn Thr Ser Val. lie Thr 

155- 160 ' . 165 

: Gin Ala Cys Pro Lys Thr Ser Phe Glu Pro He Pro He His Tyr 

170 175 180 

30 ■ \ • . ' • , :., 

.Cys Ala Pro Ala Gly Phe Ala lie Leu Lys Cys Asn Asp Lys Lys 

185 190 . .' 195 



Phe Asn Gly Thr Gly Pro Cys Lys Asn Val Ser Thr Val Gin Cys 
35 200 .205 210 

Thr His Gly l ie Arg Pro Val Val Ser Thr Gin -Leu Leu Leu Asn 
215 220 : 225 

40 : Gly Ser Leu Ala Glu Glu Giu Val Val lie Arg Ser Ala Asn Phe 

230 • 235 ;: 240 

Thr:: Asp Asn Ala Lys Thr lie lie Val His Leu Ash Giu Thr Val 

24.5 - . : 2.50 255 " 

4 5 : ..V,- ■ - . . . • . , 7 :'.':: W:'.:.> : 

Lys I le.:Asn Cys Thr Arg Leu Gly Asn Ash Thr Arg Lys Ser \ He : 
260 265 ' r 270 ' 

Asn lie Gly Pro Giy Arg Val Leu Tyr Ala Thr Gly Giu lie lie 
50 275 280 285 

■ Gly Asp lie Arg Gin Ala His Cys Asn He Ser .. Arg Ala Gin Trp 

2 90 • ' 295 300 ' 

55 Asn Lys: Thr Leu Glu Lys Val Val Asp Lys Leu : Arg Lys Gin Phe 

■ 305 ' 310 ''" : 315 

Gly Asp Asn Thr Thr He Ala Phe Asn Arg Ser Ser Gly Gly Asp 

■■ 320 .. ... . . 325 330 . 

60 ' ' '• .' : :' . .. 

Pro Glu lie Val Met His Thr Phe Asn Cys Gly Gly Glu Phe Phe 

.... 335 340 • .34 5 • 

Tyr Cys Asn Thr Thr Gin Leu Phe : Asn Ser Thr Trp Ash Asn Thr 
65 350 355 • • 360 • 



WO 98/01564 



PCT/US97/09690 





Trp Lys 


.Asp :: . ; Pro; . 


Asn 
365 


Arg Ser Asp Asn 


lie 
370 


Thr 


Leu : 


Pro 


Cys 


Arg 
375 




lie 


Lys 


Gin 


lie 


lie 
3 BO 


Asn Met Trp Glh 


:g1u :: 
385 


val 


iGly 


Lys 


Ala 


Met 

390; 




Tyr 


Ala 


Pro 


Pro 


lie 
395 


.Arg/ Gly Glu lie 


Arg 

400: 


Cys 


Ser 


Ser 


Asn 


lie 
405 


10 


Thr 


Gly 


Leu . 


Leu 


Leu 
410 


Thr Arg Asp Gly 


Gly 
.415 


Asn 


Asp 


Asp 


Gly 


Asn 
420 


15 


Asp 


1 IIL 




Thr 


Asn 
42 5 


Arg: Thr Glu lie 


Phe 
430 


Arg 


Pro 


Gly 


Gly 


Gly 
435 


Asp 


Met 


Arg 


Asp 


, Asn 
440 


i?rp nig oet y Au 


Leu 
44 5 


Tvr 

: v. . 


Arq 


Tyr 


Lys 


Val 
450 


20 


Val 


Lys 


lie 


Glu 


Pro 
455 


Leu Gly lie Ala 


Pro 
460 


Thr 


Arg. 


Ala 


Lys 


Arg 
465 




Arg 


Val 


Val 


Gin 


Arg 
470 


Glu Lys Arg Ala 


Val 
;475 


Gly 


Leu 


Gly 


Ala 


Leu 
480 


2 5 


Phe 


Leu 


Gly 


Phe 


Leu 
485 


Gly Ala Leu Phe 


Leu 
.490 


Gly 
491 











55 



60 



(2) INFORMATION FOR SEQ ID NO: 23: 
(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH.: 1475 base pairs ; ^ 

. . (B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY : Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

35 G GTA CCT GTG TGG AAA GAA GCA AAC . AC A ACT CTA TTT 37 

Val Pro Val Trp Lys Glu Ala Ash thr Thr Leu Phe 

1 5 . . 10 • 

4 0 TGT GCA TCA GAT GCT AAA GCA TAT. GAT AG A GAA GTA CAT 76 

Cys Ala Ser Asp Ala Lys Ala Tyr Asp Arg Glu Val His 

■ -15 : 20 . ; ■ ,2.5. . _ 

AAT GTT TGG GCA ACA CAT GCC TGT GTA CCC ACA GAC CCC 115 
4 5 Asn Val Trp Ala Thr His Ala Cys Val p FO - Thr .Asp Pro ^ 

.. : - . . :: 30 - : - • ' - ■■■■35 -V: y . ■ • 

AAC CCA CAA GAA ATA GTA TTG GGA AAT GTG ACA GAA AAT . 154 
Ash Pro Gin Glu lie Val Leu Gly Asn Val Thr Glu Asn.; 

50 ■■ . 40 ., . 45 . . v^^^^ -^y-:'. 

TTT AAC ATG TGG AAA AAT AAC ATG GTA GAA CAA ATG CAT 193 : 
Phe Asn Met Trp Lys Asn Asn Met Val Glu Gin Met His 

. 55 ■'. ■:-;..: : : : , : 60^:y: : . : 



GAG GAT ATA ATC AAT TTA. TGG GAT CAA AGC TTA AAG CCA 232 

Glu Asp lie lie Asn Leu Trp Asp Gin Ser. Leu Lys Pro 

65 ■ - ■'■ 70 .:. : : 7 ^'. ;::. 



TGT GTA, AAG TTA ACT CCA CTC TGT ^p*^ : . . TGC : 27 1 

80 



TGT GTA AACi .l ift ttui ^\^n ~* 

Cys Val Lys Leu Thr Pro Leu Cys Val Thr. Leu Lys Cys. 
7 ■ — 85 ' . ■ 90 
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v AAG GAT CTG GAG AGG AAT ACT ACC TAT AAT AGO ACT ATT; 3 10 

:Lys Asp Leu Glu Arg Asn Thr Thr Tyr Asn Ser Thr I le 

95 . • 100. , 

5 : :AGG AAT AAT AGT AGT TTG GAG GGA CTA : AGA GAA CAA ATG 349 

. Thr Asn Asn Ser Ser Leu Glu Gly Leu Arg Glu Gin Met 
105 ■ • 110 115 

AC A A AC TGC TCT TTC AAC ATC ACC ACA AGT ATA AGA GAT 388 

10 Thr Asn Cys Ser Phe Asn lie Thr Thr Ser I le Arg Asp 

120 125 

AAG GTG CAG AAA GAA TAT GCA CTT TTG TAT AAA CTT GAT 427 

- Lys Val Gin. Lys Glu Tyr Ala Leu Leu Tyr Lys Leu Asp 
15 .130 135 140 

GTA GTA CCA ATA GAA GAA GAT GAC AAT ACT ACC TAT AGA. 4 66 

Val Val Pro lie Glu Glu Asp Asp Asn Thr Ser Tyr Arg; 

.'14 5 . 150 ■ 155 ' ■ • 

2 0 

. TTG ATA AGT TGT AAC ACC TCA GTC ATT ACA CAG GCT TGT 505 

Leu lie Ser Cys Asn Thr Ser Val lie Thr Gin Ala Cys 

160 . 165 

2 5 :CCA AAG ACA TCC T.TT GAG CCA ATT CCC ATA CAT TAT TGT 544 . 

; Pro Lys THr Ser Phe Glu Pro lie Pro lie His Tyr Cys 

170 . , ^ . - . 175 . 3 80 : . 

GCC CCG GCT GGT TTT GCG ATT CTA AAG TGT AAT GAT AAG 583 

3 0 : : : : Ala Pro Ala Gly Phe Ala I le Leu Lys Cys Asn Asp Lys : : : 

185 190 ' 

AAG TTC AAT GGA ACA GGA CCA TGT. AAA AAT GTC AGC ACA .622 

Lys Phe Asn Gly Thr Gly Pro Cys Lys Asn Val Ser Thr 
35 195 200 . 205-. 

GTA CAA TGT ACA CAT GGA ATT AGG CCA GTA GTA TCA: ACT 661 

. Val Gin Cys Thr His Gly lie Arg Pro Val Val Scr . Thr : 
. 21.0 • 215 . \ 220 

4 0 

CAA CTG TTG TTA AAT GGC. AGT CTA GCA GAA GAA GAG GTA 700 

Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val 

' v':'" : -" •:' : ' \' . 225 . . 230 ; :: J-'. : : :; v 

45 : GTA ATC; AGA TCT GCC AAT TTC ACA GAC AAT GCT AAA ACC 7 39 

: Val lie Arg Ser Ala Asn Phe thr Asp Ash Ala .Lys Thr 
235 : .'.. ; 2'40 . 24 5 

ATA ATA GTA CAT CTA AAT GAA ACT GTA AAA ATT AAT TGT. 778 : 

50 : -lie lie Val His. Leu Asn Glu thr Val Lvs lie Asn Cys 
250 . 255 

.ACA AGA CTT GGC AAC AAT ACA AGA AAA ACT ATA AAT ATA 817 

: thr Arg Leu Gly Asn Ash thr Arg Lys Ser lie Ash Tie;; 
'55 , ••260': ■ 265 270 .' : 

GGA CCA GGG AGA GTA CTC TAT GCA ACA GGA GAA ATA ATA 856 

Gly Pro Gly Arg Val Leu Tyr Ala Thr Gly Glu lie lie . 

:;: 275 ■. 280 • •285': • 

60 .• z " : - ■ 

GGA GAC ATA AGA CAA GCA CAT TGT AAC ATT AGT AGA- GCA 89 5 

Gly Asp He Arg Gin Ala His Cys Asn lie Ser Arg Ala 

. . • 290 • .- ' 295 ' . • ' -V: ' .• 
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V CAA TGG AAT AAG ACT TTA GAA AAG GTA GTT GAC AAG TTA 934 

Gin Trp Asn Lys Thr Leu . Glu Lys Val Val . Asp Ly.s. Leu . 
300 .' 305 310 : 

5="- " AGA AAA CAA TTT GGG GAT. A AT ACA ACA ATA . OCT TTT AAT. 97 3 
;: Arg Lys Gin Phe Gly Asp Asn Thr Thr Tie. Ala Phe Asn 

■ 315 ' w\ ■ ■ ^ : 320 • . ; . : . 

CGA TCC TCA GGA GGG GAC CCA GAA ATT GTA. ATG CAC ACT 1012 
10 Arq Ser Ser Gly Gly Asp Pro Glu lie Val Met His Thr 
. 325 • 330 335 

TTT AAT TGT GGA GGG GAA TTT TTC TAC TGT AAT ACA ACA 1051 
Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Thr Thr 
15 . '340 345 : 350 

' ■ ' ■ CAA CTG TTT AAT AGT ACT TGG AAT AAT ACT ..TGG : AAG GAT 1090 
Gin Leu Phe Asn ser Thr Trp Asn Asn Thr Trp Lys Asp 
355 360 

CCT AAC AGG AGT GAC AAT ATC ACA CTC CCA TGC AGA ATA 1129. 
Pro Asn Ser Asp Asn lie Thr Leu Pro Cys Arg. lie - 

365 370 37 5 

25 AAA CAA ATT ATA AAC ATG , TGG CAG GAA GTA GGA AAA GCA 1168 

Lvs Gin lie lie Asn Met Trp Gin Glu Val. Gly Lys Ala : 
:• '■■ 380 : 385 

ATG TAC GCC CCT CCC ATC AGA GGG GAA ATT AGA TGT TCA 1207 
3 0 •:• Met. Tyr Ala: Pro Pro lie. Arg Gly Glu lie: Arg Cys Ser 
390 , 395 .400 ... 

TCA AAT ATC ACA GGG CTG CTA CTA ACA AGA GAT GGT GGT 1246 
Ser Asn lie Thr Gly Leu Leu Leu Thr Arg Asp- Gly Gly 

"35, " 405 ■'■ 410 ••• .;': 415 :: 

AAT GAC GAT GGT AAT GAC ACG ACC ACA AAC AGG ACC GAG 1285 
: Asn Asp Asp Gly Asn Asp Thr Thr Thr Asn. Arg Thr Glu. - . 

■ ■ 420 ■ ■ ■ 425 

40 - ATC TTC AGA CCT GGA GGA GGA G AT ATG AGG GAC AAT TGG 1324 
: : • • He Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp y . ; 

: 430 _ - : . - 435 440 ,. 

4^ AGA AGT GAA TTA TAT AGA TAT AAA GTA GTA AAA ATT : GAA 1363 ; 
Arg Ser Glu Leu Tyr Arg tyr Lys Val Val Lys Ite r .Glu . 
445 ' 450 

"•^'•'•CcA .ftA GGA ATA GCA CCC ACC AGG GCA AAG AGA AG A . GTG 1402 
Pro Leu Gly lie Ala Pro, Thr Arg Ala Lys Arg Arg Val 
455 4 60 . 465 



50 



• GTG CAG AGA GAA AAA AGA GCA GTA GGA CTA GGA GCT TTG 1 4. 4 1 

: : Val Gin Arq Glu Lys Arg Ala, Val Gly Leu Gly Ala Leu 

■ 5S./\v ;: . • ...-'470 ■' 475 480 • 

"• TTC CTt GGG TTC TTG GGA GCA TAA AGC TTC TAG A 1475 
Phe Leu Gly Phe Leu Gly Ala Xaa Ser Phe. Xaa 
• 485 ' ■ 490 491 

60- . . 

(2) INFORMATION FOR SEQ ID NO: 24 : 
V ; v (i); SEQUENCE CHARACTERISTICS: . . 

(A) LENGTH: 491 amino acids 
.(B) TYPE: Amino Acid 
. 65 : (D) TOPOLOGV : Linear ■. 
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: ■::■:(:* ijV: SEQUENCE DESCRIPTION: SEQ ID NO: 24:. 

\'' : '.'■Val.:VPtd'::V^l. Trp. Lys Glu Ala Asn .Thr Thr Leu Phe Cys Ala Ser 

Asp Ala Lys Ala Tyr Asp Arg Glu. val His Asn Val Trp Ala Thr 

■ ■ - :'y.::;;; : :::;: : :- :: -.:.. .; : - . 20 • ■ ■ /. . : ;-' v •. 25 - : . • . . : : \: : :-\:y::\ ] ... 30 

- His Ala :Cy.s Val Pro Thr Asp Pro Asn Pro Gin Glu He Val Leu. 

10 , 35 40 . 45 

Gly Asn Val Thr Glu Asn Phe Asn Met Trp Lys Asn : Asn Met Val 

: : :^;- r ' ■ so ' •. 55 . 60 

15 . Glu Gin Met His Glu Asp lie lie. Asn Leu Trp Asp. Gin Ser Leu . 

65 70 • • 75 



20 



35 



50 



Lys Pro: Cys Val Lys Leu Thr Pro Leu Cys Val. Thr Leu Lys Cys , 

• • ; " 80 85 , ; 90 

Lys Asp Leu Glu Arg Asn Thr Thr Tyr A s n . S e r T h r I. i e Thr A s n . 

95 ' 100 : 105 



Asn Ser Ser Leu Glu Gly Leu Arg Glu Gin Met Thr Asn Cys Ser 

25' 110 115 120 

Phe Asn. lie Thr Thr Ser lie Arg Asp Lys Val Gin Lys Glu Tyr 

: . 125' ' . 130 . 135 

3 0 .: Ala Leu- Leu Tyr Lys Leu Asp Val Val Pro. lie Glu Glu Asp Asp 

140 145 150 



Asn Thr Ser Tyr Arg Leu lie Ser Cys Asn Thr Ser Val lie . Thr. 

155 : ' 160 . 165. 

Gin Ala Cys Pro Lys Thr Ser Phe Glu Pro lie Pro lie His Tyr 

170 175 180 



Cys Ala Pro Ala Gly: Phe :Aia- Tie Leu Lys Cys Asn Asp ..Lys Lys 
40 -185 .190'. '-.195 

Phe Asn Gly Thr Gly Pro Cys Lys Asn Val Ser Thr Val Gin Cys 

; 200 ' ; 205 : 210: 

45 : Thr His Gly lie Arg Pro Val Val Ser Thr Gin Leu Leu Leu. Asn. 

,V : ;. -V-- :: ^-- : 215 .'. .220. • 225 



Gly Ser Leu. Ala Glu Glu Glu Val Val lie Arg Ser Ala Asn Phe 

: 230 . 2 35 . 240 

Thr Asp Asn Ala Lys Thr He lie Val His Leu Asn Glu thr Val 

245 ' 250 . - : - : :255 : - 



Lys lie Asn Cys Thr Arg Leu Gly Asn Asn Thr Arg Lys Ser lie 

55- '. . V,-' 260 ... - . 265 - v 270' 

Asn lie Gly Pro Gly Arg Val Leu Tyr Ala Thr Gly Glu lie . lie 

. 275 '• 280 - 285 

60 Gly Asp; lie Arg Gin Ala His Cys Asn lie Ser Arg.. Ala: Gin Trp 

.290 . 295 300 " 



65 



Ash Lys Thr Leu Glu Lys Val Val Asp Lys Leu Arg Lys. Gin Phe 
305 ; 310 ' 315 
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10 



15 



25 



30 



.3 5 



Gly Asp Asn. Thr Thr lie Ala -Phe Asn Arg Ser ser Gly ^ly : : ^|P 

' • ' :: • 325 .3 30. 



320 



: Pro Giii lie Val Met His thr Phe Asn Cys Gly Gly Glu ?he Phe 

• '; V : 335 . 340 : 34? 

Tyr CVs Asn Thr Thr Gin Leu Phe Asn Ser Thr , Trp Asn Asn; Thr 

J - , ; 3^ . ■ 355 •• ■"■ .3^0 " 

Trp LVs Asp Pro Asn Arg Ser Asp Asn lie Thr Leu Pro Cys Arg 
, . 365 370 .. 375 

lie Lys Glri lie lie Asn Met Trp Gin Glu Val Gly Lys Ala Met 

• . 380. . . 3B5 -390: 

Tyr Ala Pro Pro lie Arg Gly Glu lie Arg Cys Ser Ser Asn lie 

: 395 400 . • • 405 

Thr Gly Leu. Leu Leu Thr Arg Asp Gly Gly Asn Asp Asp Gly Asn 
20 410 415 420 

Asp thr Thr Thr Asn Arg Thr Glu lie Phe Arg Pro Gly Gly Gly 

• — - 4 30 4 Jb 



425 



Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Arg Tyr Lys Val 



440 



Val Lys tie Glu Pro Leu Gly lie Ala Pro Thr Arg Ala Lys : Arg 
a i- r- 4 60 465 



455 



Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly Leu Gly Ala Leu 
, 470 475 480 

Phe Leu Gly Phe Leu Gly Ala Xaa Ser Phe Xaa 

; : 485 490 491 ' 



: (2) INFORMATION FOR SEQ ID NO: 25: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1435 base pairs 
4 0 (B) TYPE: Nucleic Acid 

(C) ; STRANDEDNESS: Single: 

(D) TOPOLOGY: Linear 

; (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

4 5 . CTd GAG GTA CCT GTG TGG AAA GAA GCA ACC ACC ACT 36 

Leu : Glu Val Pro Val Trp Lys Glu Ala Thr Thr Thr 

■ -v; i 5 . . • _ io . 

CT A TTT TGT GCA TCA GAT GCT AAA GCA TAT GAT TCA GAG 7 5 

50 Leu Phe Cys Ala - Ser Asp Ala . Lys Ala Tyr Asp Ser Glu . , ; 

'•' 15 ' 20 2j \ 

GCA CAT A AT GTT TGG GCC ACA CAT GCC TGT GTA CCC ACA. 1 14 
-Ala- His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr , 

• 55- ' 30 ■ 35. 

GAC CCC AAC CCA CAA GAA GTA GAA TTG GAA A AT GTG ACA 153 

Asp Pro Ash Pro Gin Glu Val Glu Leu Glu Asn Val Thr , 

/ / : ; : 4Q 4 5 ' : 50 . , 

60 : GAA AAT TTT AAC ATG TGG AAA A AT AAC ATG GTA GAA : CAG:: 1.9-2 : : 
Glu Asn Phe Asn Met Trp Lys Asn Ash Met Val Glu Glh. . 

V : 55 .. , 60 
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ATG CAT GGG GAT :ATA ATT AGT TTA: TCG GAT CAA: AGC :CTA:. 23 : 1: 

Met His Gly Asp lie lie Ser Leu Trp Asp Gin Ser Leu : J 

65 ■■ 70 . 75 

AAG CCA TGT ;GTA AAA: TTA AGC CCA CTC TGT GTT ACG TTA 270 

Lys Pro ; Cys ; Val Lys Leu. .Thr . Pro Leu Cys- Val Thr Leu 

. v 80'-' .85: ■ 90 

AAT TGC ACT - GAG CCA AAT GTT . ACT AAT AGC GAG AGA ACG 309 

Asn Cys Thr Asp: Pro Asn Val Thr Asn Ser Glu Arg Thr : . 

95 • - . / . 100 

ATA GAG: GGG : GGA GAA : ATA : AAA AAT TGC t6t TTC' AAT ATC 348 

lie Glu Gly Gly Glu He Lys Asn Cys Ser Phe Asn He 

105 110 11S 

ACC ACA AAG ATA AGA GAT AGG TTT CAG AAA GAA TAT GCA 387 

Thr Thr Asn lie Arg Asp Arg Phe Gin Lys Glu Tyr Ala 

.120 125 

CTT TTT TAT AAA GTT GAT GTA ATA CCA TTA GGT. AAT GAT 426 

Leu Phe Tyr Lys Leu Asp Val He Pro Leu Gly Asn Asp 

130 135 140 

AAT ACT AGC TAT AGG TTG ATA AGT TGT A AC AGC TCA GTC 4.65 

Asn Thr Ser. Tyr Arg Leu lie Ser Cys Asn Thr Ser Val 

'.. -'145:'- ' 150 155 :' : 

ATT ACA CAG GCC TGT CCA AAG GTA TCC TTT GAG CCA ATT 504 
lie thr Gin Ala Cys Pro Lys Val Ser Phe Glu Pro He 
.160 165 



CCC ATA CAT TAT TGT GCC CCG GCT GGT TTT CCG ATT CTA 54 3 

Pro lie His Tyr Cys Ala Pro Ala Gly Phe Ala lie Leu 

170 . . 175 180 

AAG TGT AAA: GAT AAG AAG TTC AAT GGA ACA GGA CCA TGT .582. 

Lys Cys Lys Asp Lys Lys Phe Asn Gly Thr Gly Pro Cys : , 

185 190 

ACA AAT GTC. AGC ACA GTA CAA TGT ACA CAT GGA ATT AAG 621 

Thr Asn Val Ser Thr Val Gin Cys Thr His Gly lie Lys 

195 ' 200 : : 205 

CCA GTA GTA TCA . ACT CAA CTG TTG TTA AAT GGC ACT CTA 660 

Pro Val Val Ser Thr Gin Leu Leu 1 Leu Asn Gly Ser Leu 

. 210 • 215 220: 

GCA GAA GAA GAC : ATA GTA ATT AGA TCC. GCC AAT CTC ACA 699 

Ala Glu Glu Asp He Val lie Arg Ser Ala Asn Leu Thr 

;. 225 ": ■ ■ : 2 30 

GAC AAT GCT AAA A AC ATA ATA GTA CAG CTG AAT. GAA TCT 738 

Asp Asn Ala Lys Asn lie lie Val Gin Leu Asn Glu Ser 

■235" ■ 240 ' 2-45"" 



GTA ACA ATG AAT TGT ACA AGA CCC A AC A AC AAT ACA ATG 777 
Val Thr Met Asn Cys Thr Arg Pro Asn Asn Ash Thr Met 

: ". : 2'50; '2 55- ■ 

AAA AGT ATA CAT ATA GGA CCA GGC ACA GCA TTT .TAT GCA 816 
Lys Ser lie His lie Gly Pro Gly Arg Ala Phe Tyr Ala. 
'2 60 ■"' 265- - 270 
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ACA GGA AAC ATA ATA GGA GAT ATA AGA CAA GCA CAT TGT 855 

Thr Gly Asn lie He Gly Asp He Arg Gin Ala Hxs Cys ... , :;; y. 

• :::; : 2.7.5^:' iVVi' • 280. . . '285:: ; 

5 AAG ATT AGT GGA ACA AAA TGG AAT GAC ACT T.TG AAA AAG 8?4. : : 
Asn i le : Ser • Gly Thr Lys Trp Asn;:Asp Thr Leu Lys Lys : :: 

. • ' ;.:•= ;■■ • ' : ; f. . 29.0. : . V - . ' ' / : ; V'2 : ?5>V ■ . . . . . /. . :y ' .. 

ATA GCT : ATA AAA TTA AGA CAA CAA TTT AAT AAG ACA ATA 933 
10 lie Ala. lie. Lys Leu Arg Glu Gin Phe Asn Lys Thr lie 
■■' 300 ■ '305 • , 310 

GTC TTT AAT CAA TCC TCA GGA GGG GAC CCA GAA ATT GCA 9 72 - 
Val Phe Asn Gin Ser Ser Gly Gly Asp Pro Glu lie Ala 

15 ' 315 . 320 

ACG CTC AGT TTT. AAT TGT GGA GGG: GAA TTT TTC TAC TGT 1011 
Thr Leii Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys 

325 330. .. 335 

20 AAT TCA ACA CAA CTG TTT AAT AGT ACT TGG AAT AGT ACT 1050 .: 

Asn Ser Thr Gin Leu Phe Asn Ser. Thr Trp Asn Ser Thr 
. 340 - 345 350 

25 GGG TCA AAT AAC ACT AAA GGA AAT GAC ACA ATC ACA CTC 1089 

Gly Ser Asn Asn Thr Lys Gly Asn Asp Thr He Thr Leu 
355 360 

CCA TGC AGA ATA AGA CAA ATT ATA AAC ATG TGG CAG AAA 1128 
30 Pro Cys Arg lie Arg Gin lie lie Asn Met Trp Gin Lys 

. 365 .. ■ ' 370 . 37 5 

ATA GGA AAA GCA ATG TAT GCC GCT CCC ATC AAA GGG CAA 1167 , ; 
lie Gly Lys: Ala Met Tyr Ala Pro Pro lie. Lys Gly Gin 
35 380 .385 

ATT AGA TGT TCA TCA ART ATT ACA GGG CTA ATA TTA ACA 1206 
He Arg Cys Ser Ser Asn lie Thr Gly Leu lie Leu Thr : ■ 

' 390 \/ : ;[ r;..: > : - -ws; . : 400 ' 

40 AGA GAT GGT GGT AAC AAC AAC ATG AGC AAG ACC ACC GAG 1245 

. Arq Asp Gly Gly Asn Asn Ash Met Ser Lys Thr Thr Glu 
.405 , 410 41E> 

45 ACC TTC AGA CCT GGA GGA GGA GAT ATG AGG GAC AAT TGG 1284 

Thr Phe Arg p r o Gly Gly Gly Asp Met. Arg Asp Asn Trp 
420 . 425 

AGA AGT GAA. TTA TAT AAA TAT AAA GTA GTA AAA ATT GAA 1323 
50 Arg ser Glu Leu Tyr Lys Tyr Lys Val Val Lys ile Glu 
430 435 44U 

CCA TTA GGA GTA GCA CCC ACC AGG GCA AAG AGA AGA GTC 1 362 
Pro Leu Gly. Val Ala. Pro Thr Arg Ala Lys Arg Arg. Val 
55 445 450 

GTG CAG AGA GAA AAA AGA GCA GTG GGA ATA GGA GCT GTG 1401 
Val Gin Arg Glu Lvs Arg Ala Val Gly lie Gly Ala Val 

455 •: - ; - ■ ' 460 ' "■ 465 



60 



TTC CTT GGG TTC TTG GGA GCA TAA AGC . TTC TAG A 1 4 35 
Phe Leu diy Phe Leu Gly Ala Xaa Ser Phe Xaa 

• : 470 ' : 475 4 78 
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10 



25 



40 



55 



(2 ) IN FORM AT I ON FOR SEQ ID NO: 26 : : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH:' 478 amino acids 
■. (B) TYPE: Amino Acid '. 
. (D) TOPOLOGY: Linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: • 

Leu Glu Val Pro Val Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys 

I -' : ' : --5 .. • 10 • ' ' 15 

Ala Ser Asp Ala Lys Ala Tyr Asp Ser Glu Ala His Asn Val Trp 
20 - 25 . 30 



Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn. Pro Gin Glu Val 

15 35 V.. 40 4 5 

Glu Leu Glu Asn Vai Thr. Glu Asn phe Asn Met Trp Lys Asn Asn 

50 55 60 

20 Met Val Glu Gin Met His Gly Asp lie lie Ser Leu Trp Asp Gin 

65 70 75 



Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu 
80 85 90 

Asn Cys Thr Asp Pro Asn Val Thr Asn Ser Glu Arg Thr lie Glu 
95 100 105 



Gly Gly Glu lie Lys Asn Cys Ser Phe Asn lie Thr Thr Asn lie 

3 0 110 . 115 120 

Arg Asp Arg Phe Gin Lys Glu Tyr Ala Leu Phe Tyr Lys Leu Asp 

.125 . 130 135 

3 5 Val lie Pro Leu Gly Asn Asp Asn Thr Ser Tyr Arg Leu lie Ser 

140 \. 145 150 



Cys Asn Thr Ser : Val : He ; Thr Gin . Ala Cys Pro Lys Val Ser Phe 

: ' 155 . . 160' " 165 

Glu Pro He Pro l ie His : Tyr : . Cys Ala Pro Ala Gly Phe Ala lie 

170 . ■ : 175 .. 180 



Leu Lys. Cys Lys Asp Lys Lys. Phe Asn Gly Thr Gly Pro Cys Thr 

45 V;.18S'-: 190. ; ■ 195 

Asn Val Ser Thr Val Gin Cys Thr His Gly fie Lys Pro Val Val 

200 • . 205 210 



50 Ser Thr Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Asp lie 

215 . ' ;'. : 220 ^ 225. 



Val He Arg Ser Ala Asn Leu Thr Asp Asn Ala Lys Asn lie He. 

230. ' : • - : :235 • '- '240' 

Val Gin Leu Asn Glu : Ser Val Thr Met . Asn Cys Thr Arg Pro Asn 
245 ; 2 50 255 



. Asn Asn Thr Met Lys Ser lie His . lie Gly Pro Gly Arg Ala Phe 

60 2 60 • 265 270 

Tyr Ala Thr Gly Asn lie ■lie Gly Asp He Arg Gin Ala His Cys 

275 . 280 . 285 
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55 



60 



Asn. I le Ser Gly ; Thr' lys ,Trp : Asn Asp Thr Leu Lys Lys lie Ala 



290 



lie Lys. Leu" Lys Thr lie Vai: Phe Asn Gin 



305 



'. : ^' V:V : ::: 'ser ; Ser Gly Gly Asp ' Pro Gl^ Ala Thr Leu Ser Phe Asn Cys 

10 Gly Gly Glu P h e ' Phe T y r C y s As n Ser Thr Gin Leu Phe Asn Ser 

; . 335 : . 340 ; J4b 

Thr Trp Asn Ser Thr, Gly V ; Ser Asn Asn Thr Lys Gly. Asn Asp Thr 
350 "• ■ 355 JD 



lie Thr Leu Pro Cys Arg lie Arg Gin lie lie Asn Met Trp Gin 



365: 



• : Lys lie Gly Lys Ala Met Tyr Ala Pro Pro lie Lys .Gly Gin lie 
2 0 380 • 385 Jyu 

Arg Cys Ser Ser Asn lie Thr Gly Leu lie Leu Thr. Arg Asp Gly 



.395 



25 Gly Asn Asn. Asn Met Ser -Lys Thr Thr Glu Thr Phe Arg Pro Gly 



410: 



Gly Gly Asp Met Arg Asp Asn Trp. Arg Ser Glu Leu Tyr Lys Tyr 

: 425 . • 430 * 

Lys' Val Val Lys lie Glu Pro Leu Gly Val Ala Pro Thr Arg Ala 
440 445 

. Lys Arg Arg Val Val Gin irg. Glu.. Lys Arg Ala Val Gly, lie Gly 
35 • 455 . 460 ' . . 

Ala Val Phe Leu Gly Phe Leu Gly Ala Xaa Ser Phe Xa^a 
• 470 ■ : 475 478 

4 0 (2) INFORMATION FOR SEQ ID NO: 27 : 
(i) SEQUENCE, CHARACTERISTICS: 

( A ) LENGTH : .1435 base- pairs 
( B) TYPE :. /Nucleic Acid . . 
' (C) STRANDEDNES S : '. : S ingle 
4 5 (D) TOPOLOGY : Li hea r 

(xi) SEQUENCE DESCRIPTION:: SEQ; ID NO:27: : 

CTC GAG GT A CCT GTG .TGG AAA G A A GCA ACC ACC ACT 36 
. Leu Glu ValProiVal-TrpLysGlu Ala Thr Thr Thr 

so ."i ■' : - : ;'.^V\l^v£ : : ; \ ' : ' , ' 10: ... ; ; . 

CTA TTT TGT GCA TCA GAT GCT AAA GCA TAT GAT TCA GAG 75 



Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp Ser. Glu 

15 20 ■.=■ . ..'-'.X; 

GCA CAT AAT GTT TGG GCC ACA : CAT CCC TGT GTA CCC ACA 114 
Ala His Asn Vai Trp .Ala: Thr His Ala Cys Val Pro Thr 

30: ..;; :: :-v':- . 35 

GAC CCC AAC: CCA CAA -GAA: GTA. GAA TTG GAA AAT GTG ACA 153 
Asp Pro Asn Pro Gin Glu Val Glu Leu Glu Asn Val Thr 

40 ■\-r;-.:r^::C: : *s-2. 50 
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: GAA A AT. TTT AAC ATG TG G A A A A A T A AC : ATG GTA CAA CAG 192 
Glu Ash Phe Asn Met Trp Lys Asn As n Me t Val Glu Gln 

' • - . • ' ' ' • ; _ ;V: ' 55 ' ' :: • ^/^'V .V 60 ■ V. ' ' , .J. 

5 ATG CAT GGG GAT ATA ATT ; AGT; : TT A : ^TGG G AT GAA AG C CTA 231 

. Met His Gly Asp He lie Ser Leu; Trp . Asp Gin Ser Leu 

• • . 65 . ■ ; :; ;v . . " 70 /. " 75 

AAG CCA TGT GTA AAA TTA ACC CCA CTC TGT GTT ACG TTA 270 
10 Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu 

80 :' 85: 90 

AAT TGC ACT GAC CCA AAT GTT ACT AAT. AGC GAG AGA ACG 309 
Asn Cys Thr Asp Pro Asn Val Thr Asn Ser Glu Arg Thr 
15 95 ' 100- 

ATA GAG GGG GGA GAA ATA AAA AAT TGC TCT TTC AAT ATC 348 
lie Glu Gly Gly Glu lie Lys Asn Cys Ser Phe Asn lie 
105 110 115 



20 



60 



ACC ACA AAC ATA AGA GAT AGG TTT CAG AAA GAA TAT GCA 387 
Thr Thr A^n lie Arg Asp Arg Phe. Gin Lys Glu Tyr Ala 
120 12 S . 



25 CTT TTT TAT AAA CTT GAT GTA ATA CCA TTA CGT AAT GAT 426 

Leu Phe Tyr Lys Leu Asp Val lie Pro Leu Gly Asn Asp 

1 30 : . . 135 . 140 

AAT ACT AGC TAT AGG TTG ATA ACT .TGT AAC ACC TCA GTC 4 65 

30 . Asn Thr Ser Tyr Arg Leu I le Ser Cys Asn Thr Ser Val 

.145 150 155 



ATT ACA CAG GCC TGT CCA AAG GTA TCC TTT GAG CCA ATT 504 
He Thr Gin Ala Cys Pro Lys Val Ser Phe Glu Pro He 
35 160 1.65 

CCC ATA CAT TAT TGT GCC CCG . GCT GGT TTT GCG ATT CTA 543 

Pro lie His Tyr Cys Ala Pro Ala Gly Phe Ala lie Leu . 
170 175 : 180 

4 0 . ■ 

AAG TGT AAA GAT AAG AAG TTC AAT GGA ACA GGA CCA TGT 582 

Lys Cys Lys Asp Lys Lys Phe Asn Gly Thr Gly Pro Cys 

. ass' ; ■ , " 190 

45 ACA AAT GTC AGC ACA GTA CAA TGT. ACA CAT GGA ATT AAG 621- 

Thr Asn Val Ser Thr Val Gin Cys Thr His Gly lie Lys V 

. . 195 / . 200 205 

. CCA GTA GTA . TCA ACT CAA CTG TTG TTA AAT GGC AGT CTA 660 
50 Pro Val Val Ser Thr Gin Leu Leu Leu Asn Gly Ser Leu • . 

■ 210 215 : . ■ 220 . . 

GCA GAA GAA GAC ATA GTA ATT AG A TCC GCC AAT CTC ACA 699 
Ala Glu Glu Asp lie Val l ie Arg Ser;: Al a Asn Leu Thr 

55 ■ • 22 5 . .. 230 ' 

GAC AAT GCT AAA AAC ATA ATA : GTA CAG CTG AAT GAA TCT 7 38 
Asp Asn Ala: Lys Asn lie lie Vai Gin Leu Asn Glu Ser 
■ 235- . 240 24 5 



GTA ACA ATG AAT TGT ACA AGA CCC AAC AAC AAT ACA ATG 777 
Val Thr Met Asn Cys Thr Arg Prov .Asn Asn Asn Thr Met 
. .250 25.5 
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AAA ACT ATA CAT ATA GGA CCA : GGC iAGA GCA TTT- T A T GCA; 816, 
JJS Ser 51 His tie Gly Pra^ly Arg Ala ?h« Tyr Ala :; . ; ; 

260 • 2 65 ■ •• 11 ;■ 

■ ACA GGAf AAC ATA ATA^GG^ : GAT? ;ATA/ : AGA CAA GCA CAT TQ| 855 ; 
Thr Gly Asn lie I le Gly Asp; lie Arg 01" Ala His Cys 



275 



AAC ATT AGT GGA ACA AAA TGG A AT GAC ACT TTG AAA A AG 894 
As5 He Ser Gly Thr Lys. Trp Ash- Asp Thr. Leu Lys Lys 



290 



ATA GCT ATA AAA TTA :AGA GAA CAA TTT AAT AAG ACA ATA 933 
lie Ala lie Lys Leu Arg Glu Gin Phe Asn Lys Thr lie 

15 300 X ,. 305 v;^. . ; - 310 

ETC TTT AAT CAA TCC TCA GGA GGG GAC CCA GAA ATT GCA 972 
vll III Asn Gin ser Ser Gly Gly Asp .Pro Glu lie Ala 
315 v ; 320 , 

ACG CTC AGT TTT AAT TGT GGA GGG GAA TTT TTC TAC TGT 1011 
Thr Leu ser Phe Asn Cys G : ly Gly Glu Phe Phe Tyr Cys 
325 330 : 335 



25 AAT TCA ACA CAA CTG TTT AAT AGT ACT TGG AAT AGT ACT 1050 

25 Asn Ser Thr Gin Leu Phe Asn Ser Thr Trp Asn Ser Thr 

340 • 345 . JblJ 

GGG TCA AAT AAC ACT AAA GGA AAT GAC ACA ATC ACA CTC 1089 
30 Gly Ser Ash Asn Thr Lys Gly Asn Asp Thi lie Thr Leu 

• . . - 355 ■ . 360 ■ 

CCA TGC AG A ATA AGA CAA ATT ATA AAC ATG TGG CAG AAA 1128 
Pro Cys Arg He Arg Gin lie lie Asn Met Trp Gin Lys- 
35 365 370 375 

ATA GGA AAA GCA ATG TAT GCC CCT CCC ATC AAA GGG CAA 1167 
He Gly Lys Ala Met Tyr Ala Pro Pro He Lys Gly Gin 
380 . 385 . .-. 

ATT AGA TGT TCA TCA AAT ATT ACA GGG CTA ATA TTA ACA 1206 
He SrgSs Ser Ser Asn He Thr : Gly Leu lie Leu Thr 
390 395 ; 

AGA CAT GGT GGT AAC AAC AAC ATG AGC AAG ACC ACC GAG 1245 
Arg Asp Gly Gly Asn Asn Asn Met Ser Lys Thr Thr Glu 

■ 405 410 

ACC TTC AGA CCT GGA GGA GGA GAT ATG AGG GAC AAT TGG 1284 
?hr III Arg Pro Gly Gly. Gly- Asp Met Arg Asp Asn Trp 

• ' 420 ■ 425 

AGA AGT GAA TTA TAT AAA TAT AAA GTA CTA AAA ATT GAA 1323. 
5rg Ser Glu .Leu Tyr Lys Tyr Lys Val Val .Lys «lu . 



55 430 



CCA TTA GGA GTA GCA CCC ACC ACG.GCA AAC AGA AGA GTG 1362 
Pro Leu Gly Val Ala Pro Thr Arg-Ala Lys Arg Arg Val . 

■445 ' : ^;,:*M. • 



GTG CAG AGA GAA AAA AGA . GCA GTG; GGA ATA GGA GCT GTG 1401 
Val 
455 



GTG CAG AGA GAA AAA AGA GCA . toito ^ ",™ " — — 
val Gin Arg Glu Lys Arg Ala ; Val Gly He Gly Ala Val 
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: ; .v : ; : .y ;; . .TTC CTT GGG TTC TTG GG A G G A TA A AG C TTCTAG. A 1435 
-Phe Leu Gly . Phe Leu Gly Ala Xaa Ser Phe Xaa ; : 

. . : . ' 470 /; : .v:V' : ": : ,' : .'' 475 V. .;. ^i&^y^.y. 

. 5 • ( 2 ) IN FORM AT I ON FOR V SEQ ID NO: 28 : 
: : : : < i) ; SEQUENCE CHARACTERISTICS: 

. ( A ) LENGTH: 4 78.. amino' acids : 
(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 
10 (xi) SEQUENCE DESCRIPTION: . SEQ ID NO: 28: 



15 



2 0 



25 



30 



3 5 



40 



4 5 



50 



55 



60 



65 



Asr 



val 
5 


Trp 


Lys 


Glu 


Ala 


Thr 
10 


Thr. 


Thr 


Leii 


Phe 


Cys 
15 


Lys 
20 


Ala 


Tyr 


Asp 


Ser 


Glu 
25 


Ala 


His 


Asn 


Val 


Trp 
30 


Cys 
35 


Val 


Pro 


Thr 


Asp 


Pro 
40 


Asn 


Pro 


Gin 


Glu 


Val 
45 


Val 
50 


Thr 


Glu 


Asn 


Phe 


Asn; 
55 


Met 


Trp 


Lys Asn 


Asn 
60 


Met 
65 


His 


Gly 


Asp 


lie 


lie 
70 


Ser 


Leu Trp 


Asp 


Gin 
7 5 


Cys 
80 


Val 


Lys 


Leu 


Thr 


Pro 
85 


Leu: 


cys 


Val 


Thr 


Leu 
90 


Pro 
95 


Ash 


Val 


Thr 


Asn 


Ser- 
100 


Glu 


Arg 


Thr 


; I le 


Glu 
105 


Lys 
110 


Asn 


Cys 


Ser 


Phe 


Asn 
115 


:I.le. 


Thr Thr 


Asn 


He 
120 


Gin 
125 


Lys 


Glu 


Tyr 


Ala 


Leu 
130 


Phe: 


Tyr 


Lys 


Leu 


Asp 
135 


Gly 
140 


Ash 


Asp 


Asn 


Thr 


Ser 
145 


Tyr. Arg 


Leu 


He 


Ser 
150 


Val 

155 : 


' He. 


Thr 


Gin; 


Ala 


Cys 
160 


Pro 


Lys.; 


Val , 


Ser 


Phe 
165 


lie 
170 


His 


Tyr. 


Cys 


Ala 


Pro 

17 5' 


Ala Gly 


Phe 


Ala; 


lie 
180 


Asp 
185 


. Ly S: 


Lys 


Phe 


Asn 


Gly 
190 




Gly 


. Pro 


Cys 


Thr 
195 


Val 
200 


Gin 


Cys 


Thr 


His 


Gly 
205 


lie 


Lys- 


Pro 


val 


Val 
210. 


Leu 
215 


Leu 


Asn. Gl y 


Ser 


Leu 
220 


Ala 


Glu 


Glu 


Asp 


lie 
225 


Ala 
230 


Asn 


Leu 


Thr 


:Asp 


Asn : 
235 


Ala 


Lys 


Asa 


He 


lie 
240.. 


Glu 
245 


Ser 


Val 


Thr 


Met 


Asn 
250 


Cys 


Thr 


Arg 


Pro 


Ash 
255 


Lys: 
260. 


Ser 


lie 


;.HiS 


lie 


Gly 
2 65 


Pro 


Gly 


Arg . 


Ala 


Phe 
270 



-14 8- 



WQ98/01564 



PCT/US97/09690 



Tyr Ala Thr Gly Asn lie lie Gly Asp His ^| 

Asn lie Ser Gly Thr Lys Trp^ Asn Asp Thr Leu Lys ^ Lys lie Ala 

He Lys Leu Arg Glu Gin Phe Asn Lys- Thr l ie Val. Phe Asn Gin 
305 . ;: 31° - J - 

10 . V Ser Ser Cly Gly Asp. Pro Glu He Ala Th^,. Leu Ser. Phe. Asn Cys 
... 320 ' . 325' • • " u 

■ \ Gly Gly Glu Phe Phe Tyr cys Asn Ser Thr Glh Leu Phe Asn Ser 
• 335 . 340- j 



15 Thr Trp Asn Ser Thr Gly Ser Asn Asn Thr l^s Gly ;Asn Asp Thr- 

■ 350 355 . JOU 

He Thr Leu Pro Cys Arg He Arg Gin lie lie Asn Met Trp Gin 

20 . 365 . 370 ■ 375 

Lys lie Gly Lys. Ala Met Tyr Ala Pro Pro lie Lys Gly Gin lie 

. .-380 38b -- 

25 . Arg Cys Ser Ser. Asn lie thr Gly Leu lie Leu Thr Arg Asp. Gly 

395 ■ 

Gly Asn Asn Asn Met, Ser Lys Thr Thr Glu Thr Phe Arg Pro. Gly 

30 Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr 

42 5 .4 30 '• , 

Lys Valval Lys lie Glu Pro Leu Gly Val Ala Pro Thr Arg Ala. 

3 5 440. • • 44 5: ■ -j- 

• Lys. Arg Arg Val Val Gin Arg. Glu Lys Arg Ala Val. Gly l ie Gly . : 

40 -Ala Val Phe Leu Gly. Phe Leu Gly ,Ala Xaa Ser Phe Xaa 

. ..... 4.7:0 .. , 475 „ ; . 478 • ' 

(2) INFORMATION FOR SEQ ID NO: 29 : • 
( i) SEQUENCE .CHARACTERISTICS : 
.45. ,., (A) LENGTH: 511 amino acids . 

I B ) TYPE: Amino Acid 

ID ) TOPOLOGY : Linear : ™ 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: , .. . 

50 Met Arg. Val Lys! Gly lie Arg Arg Asn Tyr -Gin His Trp .Trp. Gly Arg 
Gly Thr. Met Leu Leu Gly Leu Leu Met lie Cys Ser Ala Thr Glu Lys 

. ' 20 ■ 2s;.,, :V,- :;;; ; : .^;. : : 30 . 

Leu Trp Val Thr Val Tyr Tyr Gly Vai: Pirp- Val Trp Lys. Glu Ala Thr 

35 40 ■ 0 " - : . ■ 



55 



Thr Thr Leu Phe Cys Ala Ser Asp. Ala Lys.- Ala Tyr Asp Thr Glu Ala 

60 50 55 ■ ; ;60: ... 

His Ash. Val Trp Ala thr His. Ala Cys Val Pro Thr Asp Pro Asn Pro 

■. -(is. ■ . - 70 ' 75, -80. 
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Gin Glu.. Val Glu Leu Val Asn .Val Thv G lu Asn Phe > i Asn Met : Trp Lys 

' • 85 ' ' "'90 ••'•••95- ./v'." 

■ Asn ..Ash Met Va 1 Glu Gin Met Hi s Gl li: Asp I le iie; : Ser Leu Trp Asn 

5 :';'. : . : :' : V-,: r .: : : : 100 " : '• .' . " ' ^05' : • ' ' ■■ ^ \ 110 / '^V'/ " ' ' 

Gin. .Ser Leii Lys Pro Cys Val . Lys Leii Thr . Pro Leu Cys Vai Thr Leu 

ll-S; ' 120 i. .. : = . ' 1 2 5 ' ; ' ■ 

10 Asn .Cys Thr Asp Leu Arg Asn, Thr . Thr Asn Thr Asn- Asn Ser : Thr Asp 

: . 130 135 • • • 140 

Asn Ash Asn Ser Lys Ser Glu Gly Thr lie Lys Gly Gly Glu Met^ Lys. 

145 150 -155 160 

15 . . • . ; ■. . • ■ • ' - . , : : 

Ash Cys Ser Phe Asn lie Thr Thr Ser lie Gly Asp Lys Met Glh Lys 

' 165 : . . ; 170' . . 175 '■' 

Glu Tyr Ala Leu Leu Tyr Lys Leu Asp lie Glu Pro lie Asp Asn Asp 

20 180 18 5 190 

Ser Thr Ser Tyr Arg Leu lie Ser Cys Asn Thr Ser Val lie Thr Gin 

195 200. 205 



25 Ala Gys Pro Lys lie Ser Phe Glu Pro lie Pro lie His Tyr Cys Ala 

210 ■ 215 : 2 20 . • ' 

Pro Ala Gly Phe Ala lie Leu Lys Cys Ash Asp Lys Lys Phe Ser Gly 

: 225 : 230 23 5 ■ 240 

. . . Lys Gly Ser Cys Lys Asn Val Ser Thr :Val Cln Cys Thr His Gly lie 

245 •' 250 . ; : . 25 5 ■ 

Arg Pro Val Val Ser Thr Gin Leu Leu Leu Ash Gly Ser Leu Ala Glu 

35 • '.- 260 • . 265 270 

Glu Glu Val Val lie Arg Ser Glu Asp Phe Thr Asp Asn Ala Lys thr 

• • . 275 280 ■ • ■ /. ..285' v.';-: 

4 0 lie . lie Val His Leu Lys Glu Ser Val Gin lie Asn Cys Thr Arg. Pro 

290 295 300 ' ' " ■ 

Asn. Tyr Ash Lys Arg Lys Arg lie His 1 1 e Gly Pro Gly Arg Ala Phe 

' .v;-3-0:5.X ■ 310 .' .' 315 • 320 

4 : 5 V /--V".-" '.V-V -= : = /:■■:■■'.- . -:. . ' ; ■■ • ' ■ ' • . ■ . . : 

. Tyr Thr Thr Lys Asn lie Lys Gly Thr lie Arg Glh Ala His Cys lie 

325 : -'330 , :V.\V ' '-3 35\ "": . 

lie Ser Arg Ala Lys Trp Asn Asp Thr Leu Arg Glh lie Vai Ser Lys 

50 ■v', ; . : .' : V; 340 ■'. .'• 345 350 

Leu Lys Glu Gin Phe Lys Asn Lys Thr lie Va 1 Phe Asn Pro Ser Ser 

355 ■. . : ... 360 365 

55 •; Gly , Gly Asp Pro Glu lie Val Met His Ser Phe Asn. Cys Gly Gly Glu 

• : -.v : :- .'. ; _;' '370 : 375 ' ' : .380.. V : 

Phe. .Phe Tyr Cys Ash Thr Ser Pro Leu Phe Asn Sen* lie Trp Asn Gly 

. 385 ... . 390 395 : 400 

60 . : .. v • ' ".; .. . v . ■ ... 

Asn: Asn Thr Trp Asn Asn Thr. Thr. Gly Ser Ash Asn Asn lie Thr Leu 

4Q5 . ; • . 410'' ■ 415 " ■ 

Gin Cys Lys lie Lys Gin lie lie. Asn Met Trp Gin Lys Val Gly Lys 

65 ' 420 , . : 425 430 
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Ala Met Ty r : Ala Pro Pro lie Gla /Gly : Gin Ue Arg Cys Ser Ser Asn . 
,7 .435 - : ■ .. V 440 . ' 445 

: lie Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Glu: Asp • Thr Asp Thr 

. 5 . \: ; ;-;v-y.:^4S0V' : '^v^>; 455 '460 ;;v. : /" - 

Ash Asp thr Giu lie Phe Arg Pro Gly Gly Gly Asp Met Arg, Asp Asn 

'■. • ; /. 4 : 65 . .:V ; :. .470. 475 . 480 

10 Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Thr lie Glu Pro Leu 

485 490 



15 



55 



60 



Gly Val Ala Pro Thr Lys Ala Lys Arg Arg Val Val Gin Arg Glu 
500 505 510 - 



(2) INFORMATION FOR SEQ ID NO: 30 

{ij SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2800 base pairs 
20 (B) TYPE: Nucleic Acid 

'• (C) STRANDEDNESS : Single. 
(D) TOPOLOGY: Linear 
. <xi) • SEQUENCE DESCRIPTION: SEQ ID NO: 30: 



25 


TTCGAGCTCG 


CCCGACATTG 


ATTATTGACT 


AGAGTCGATC 


GACAGCTGTG 


. 50 ■ 




■GAATGTGTGT 


CAGTTAGGGT 


GTGGAAAGTC 


CCCAGGCTCC 


CCAGCAGGCA 


100 




GAAGTATGCA 


AAGCATGCAT 


CTCAATTAGT 


CAGCAACCAG 


GTGTGGAAAG 


150 


3 0 


TCCCCAGGCT 


CCCCAGCAGG 


CAGAAGTATG 


CAAAGCATGC 


ATCTCAATTA 


; V 200 ..V 




GTCAGCAACC 


ATAGTCCCGC 


CCCTAACTCC 


GCCCATCCCG 


CCCCTAACTC 


250 


35 


CGCCCAGTTC 


CGCCCATTCT 


CCGCCCCATG GCTGACTAAT 


TTTTTTTATT 


300 : 




TATGCAGAGG 


CCGAGGCCGC 


CTCGGCCTCT 


GAGCTATTCC 


AG A AG TAG TG 


350 . 


40 


AGGAGGCTTT TTTGGAGGCC 


TAGGCTTTTG 


CAAAAAGCTA GCTTATCCGG 


400 


CCGGGAACGG 


TGCATTGGAA 


CGCGGATTCC 


CCGTGCCAAG 


AG TC AG GTA A 


450 




GTACicGCCTA 


TAG AGTCT AT 


AGGCCCACCC 


CCTTGGCTTC 


GTTAGAACGC : 


:. : ; : : 5Q0: : ; : 


4 5 


GGCTACAATT 


AATACATAAC 


CTTTTGGATC 


: GATCCTACTG 


ACACTGACAT ; . 


" : r v550 : ' 




CCACTTTTTC 


TTTTTCTCCA 


CAGGTGTCCA 


CTCCCACGTC 


CAACTGCACC 


600 . 


50 


TCGGTTCGCG 


AAGCTAGCTT 


GGGCTGCATC 


GATTGAATTC 


CACTGCCTTC , ; ; 


650 : 


C A CCA AG CT C 


TGCAGGATCC 


CAGAGTCAGG 


GG TCT GTA 
Ser Val 

1 


TCT TCC ■ TGC 
Ser Ser Cys 

5 


.697 



TGG TGG CTC CAG TTC AGG A AC AGT AAA CCC TGC TCC GAA TAT v 739 
Trp Trji> Leu Gin Phe Arg Asn Ser Lys Pro Cys Ser Glu Tyr ... ; : . 

."■ ' . ■ : , io . •• t 15 . . ■ 

TGC CTC TCA CAT CTC GTC AAT CTC CGC GAG GAC TGG GGA CCC 781 

Cvs Leu Ser His Leu Val Asn Leu Arg Glu Asp Trp Gly Pro-; : . 
20 . .' ■ 25. ,: ■ - 3"0-,- : . 
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TCT GAC AAG GTT CAG CGC GAA CGA GCA: ACT ACC CCG ATC ATC : ^823 

Cys Asp- Lys Leu Gin Arg Glu -Arg Pro Thr Thr Pro : l ie 'lie... ': 

■ ' • .;-35 ' 40: ; ■ : 45 

5 ' AGT TAT CCT T A A GGT CTC TTT TGT GTG '•■ GTG CGT TCC GGT ATG . 865. 

Ser Tyr Pro ' * Gly Leu Phe Cys Val Val Arg Ser Gly : . Met : 

50 . 55 . \ • 60 ■ 

GGG GGG ACT GCC GGC AGG TTG GGG GCC GTG ATT TTG TTT'GTG 907. 

10 Gly Gly Thr Ala Ala Arg Leu Gly Ala Val He Leu Phe Val 

62 65 70 75 . ... 



15 



25 



45 



GTC ATA GTG GGC CTC CAT GGG GTC GGG GGC AAA TAT GCC TTG . : . 949 
Val lie Val Gly Leu His Gly Val Arg Gly Lys Tyr Ala Leu 

80 ' 85 



. GCG GAT GCC -TCT CTC AAG ATG GCC GAC CCC AAT CGA TTT CGC 991 
Ala Asp -Ala Ser Leu Lys Met Ala Asp Pro Asn Arg Phe. Arg 
20 90 95 100 



GGC AAA GAC CTT CCG GTC CTG GAC CAG CTG CTC GAG GTA CCT . 1033 

Gly Lys Asp Leu Pro Val Leu Asp Gin Leu Leu Glu Val Pro 

105 . 110 115 

GTG TGG AAA GAA GCA AAC ACC. ACT CTA TTT TGT GCA TCA . GAT 1075 

Val Trp Lys Glu Ala Asn Thr Thr Leu Phe Cys. Ala Ser Asp 

• •• • . 120 12:5 130 ;. . 

30 GCT AAA GClA TAT AAG ACA GAG GCA CAT AAT GTT TGG GCC ACA 111 7 

Ala Lys Ala Tyr Lys Thr Glu Ala His Asn Val Trp Ala Thr 

.135 140 145 

CAT GCC TGT GTA CCC ACA GAC CCC AAA CCA CAA GAA ATA AAA 1159 

35 . . His Ala Cys Val Pro Thr Asp Pro Lys 1 Pro Gin Glu He Lys . 

150- ■ 1.55 

TTG GAA AAT GTG ACA GAA AAT TTT AAC ATG TGG AAA AAT AAC .1201: 

: Leu Glu . Ash Val Thr Glu Asn Phe Asn Met Trp Lys Asn Asn : : ; 
40 160 165 170 



ATG GTA . GAA CAG ATG CAT GAG GAT ATA ATC AGT. TTA TGG GAT 124 3 

Me t Val G 1 u Glh Met His Glu Asp lie lie Ser Leu. Trp Asp 
: i : 75\- , .180 ;. .185 

CAA AGC CTA AAG CCA TGT GTA AAA TTA ACC CCA CTC TGT . GTT . 1285 

Gin. Ser Leu Lys Pro Cys Val Lys Leu. Thr Pro Leu Cys Val 

190 195 ■■ ■ 200 ' 

50 ACT TTA . AAT TGC ACT GAT TTG AGG AAT AAT ACT AAT ACC AAT . . : 1327 

Thr Leu Ash. Cys Thr Asp Leu Arg Asn Asn Thr Ash Thr Asn 

205 • " 2 10 \ 215 ; 

AGT ACC TAC CGA AAA ATA ATG GAG GGA GGA GAG ATA AAA AAC 1369 

55 Ser Thr Tyr Gly Lys He Met Glu Gly Gly Glu lie Lys Asn 

220 225 



TGC. TCT T.TC AAT ATC ACC ACA AGC ATA AAA GAT AAG CTG AAA : 1411 
60 Cys Ser Phe Asn lie thr Thr Ser lie Lys Asp Lys Leu Lys . 
230: '235 " 240 



GAT ATG TCA CTT TTT TAT AAA CTT GAT GTA GTA CCA ATA GGT : 1453 : 
Asp-Met -Ser Leu Phe Tyr Lys Leu Asp Val Val, Pro He Gly, 
65 . 250: 255 
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10 



4 0 



45 



AAT A AT AGT AAT ACT ACT AGT TAT. AGG TTG ATA AGT TGT AAC : .1495 
Asn Asn Ser Asn Thr Thr ■= Ser- ;Tyr VAr<3 Leu ; I le Ser: ; GyB:: Asn - ^ . ... 
•.• 260 • 265'-- .:: :.: A : 270 

ACC TCA GTC ATT ACA CAA GCG: TGT CCA AAG ACA TCC TTTGAG 1537 
Thr Ser Val lie Thr Gin-Ala /.Cys Pro Lys Thr Ser Phe.Glu; 

' 27 5 .. • 280. - : 285 

CCA ATT CCC ATA CAT TAT TGT GCC CCG GCT GGT TTT GCG ATT 1579 
Pro tie Pro lie His Tyr Cys Ala Pro Ala. Gly Phe Ala lie 

290 .295 



CTC AAG TGT AAT GAT AAT AAG; TTC AAT GGA ACA GGA CCA TGT 1621 

15 Leu : Lys Cys Asn Asp Asn Lys Phe Asn Gly Thr Gly Pro Cys 

300 305 310 

CCA AAT GTC AGC ACA GTA CAA. TGT ACA CAT GGA ATT AGG CCA 1663 

' Pro Asn Val Ser Thr Val Gin Cys Thr His Gly He Arg Pro 

20 315 320 325 

GTA GTA TCA ACT. CAA CTG CTG TTA AAT GGC AGT CTA GCA GAA 1705 
Val Val Ser Thr Gin Leu ; Leu : Leu Asn Gly Ser Leu AlaGlu 
330 : ; 335. 340 : 

AAA GAG GTA GTC CTT AGA. TGT. GAA AAT TTC ACG GAC AAT GCT 1747 
Lvs Glu Val Val Leu Arg.SerGlu Asn Phe Thr Asp Asn Ala 
345 .350 355^ 

30 AAA ACC ATA ATA GTA. CAG. CTG : AAC GAA TCT GTA ATA ATT GAT .1789 

Lvs Thr He He Val Gin Leu Asn Glu Ser Val lie lie Asp 

'. y ■ ' ■' 360 ."v." • ■ 365 

TGT ATG AGA CCC AAC AAC AAT ACA AGA ACA AGT ATA CCT ATG 1831 

35 Cys Met Arg Pro Asn Asn ; Asn Thr. Arg. Thr ser He Pro Met 
370 : 375 •• 380 

GGA CCA GGG AAA GCA TTT : TAT GGA. AC A GGA GAT GTA ATA GGA 1873 

Gly Pro Gly Lys Ala Phe Tyr Ala Thr Gly Asp Val He Gly 

. ' 385 . .390 . 395 

GAT ATA AGA CGA GCA CAT TGT AAC ATT AGT AGA GCA GGA TGG 1915. 

Asp lie Arg Arg Ala His Cys: Asn: Tie Ser Arg Ala Gly Trp 

. . • • 4oo •-\-;V.v;: ,405.:'- ;^io • 

AAT ACC ACT. TTA CAA CAG ATA GCT AAA AAA TTA AGA GAA AAA 1957 
Asn Thr Thr Leu Gin Gin lie Ala Lys Lys Leu Arg Glu Lys 

415 . V--420 . 425 

50 TTT GAG AAC AAA ACA ATA GTT : TTT. AAT CAC TCC TCA GGA GGG 1999 
Phe Glu Asn Lys thr lie Val Phe Asn His Ser Ser Gly Gly 
. -430 435 . . 

GAC CCA GAA ATT GTA ATG CAC ACT TTT AAT TGT GGA GGG GAA 2041 
55 Asp Pro Glu He Val Met His Thr Phe Asn Cys Gly Gly Glu 

446 .. ' ;4 45' ■ . 450 ■ ■ ■ 

TTT TTC TGC TGT AAT TCA ACA CCA CTG TTT AAT AGT ACT TGG . 2083 
Phe Phe Cys Cys Asn Ser Thr Pro Leu Phe Asn Ser Thr Trp 
60 .4 55 ' 460 ' 465 

AAT GAT GCA CAA CTG TTT AAT AGT ACT. TGG GAT GAT ACT AAA 212 5 

Ash Asp Ala Gin Leu Phe Asn : Ser Thr. Trp Asp . Asp Thr Lys 
. '.470 • :':475- 480 . 
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TGG TCA AAA; GGC ACT ARC GAA A AT GAG AG A ATC: ACC CTC CAT 2167: 
: Trp Ser Lys Gly Thr Asn Glu Asn Asp Thr lie Thr Leu His 

• = •• •• •: .. " ABS^ ' ■ ■■ 490 ■ V'^/^'.vV. / V-49 5" 

: 5 . . ; : TGG AG A ATA AAA CAA^ ATT ; ATA AAT ATG TGG GAG GAA GTA GGA ; 2209 
Cys Arg lie Lys Gin lie lie Asn Met Trp Gin Glu Val Gly 

500 ; : ' 505' 

AAA GCA ATG TAT GCC GGT CGC ATC AAA GGA CAA ATT AGA TGT .2251 

10 .Lys Ala. Met Tyr Ala Pro Pro lie Lys Gly Gin lie Arg Cys 

510 515 520 

GAA TCA AAT ATT ACA GGG CTG CTA TTA ACA AGA GAT GGT GGT 2293 

15 Glu Ser Asn lie Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly 

525 v ■ 5 30- '■' 535 ■.. • 

. AAC GAC ACG AG C A AG AAT A AC ACT GAG ATT TTC AGA CCT GGA 2 335 
20 Asn Asp Thr Ser Lys Asn Asn Thr Glu lie P.fte Arg Pro Gly 
540 '545 ■ 550 

GGA GGA AAT ATG A AG GAC AAT TGG AG A ACT GAA TTA TAT AAA 23.77. 
Gly Gly Asn Met Lys Asp Asn trp Arg Ser G Ju Leu tyr. Lys 
25 555 . • 560 • 565 ' 



TAT AAA GTA ATA AAA; ATT GAA CCA TTA GGA GTA GCA CCC ATC 2419 
Tyr Lys Val lie Lys lie Glu Pro. Leu Gly Val Ala Pro lie 



3 0 



3 5 



4 0 



4 5 



50 ( 2) INFORMATION FOR SEQ ID NO: 31 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 579 amino acids '. 

( B ) TYPE: Amino Acid 
(0) ; TOPOLOGY : Linear 

55 (xi ) SEQUENCE DESCRIPTION: SEQ ID. NO: 31 : . . . 

.: Ser Val Ser Ser Cys Trp Trp Leu Gin Phe Arg Asn Ser Lys Pro Cys 

1 ' v / • 5:-: .; 10 ; ..^ . ' " = 15 

60 Ser Glu Tyr. Cys Leu Ser His Leu Val. Asn Leu Arg Glu Asp Trp Gly 

20 ■ • 25. ■ ■ ' 30 ■'' 

Pro Cys Asp Lys Leu G In Arg ;G lu Arg Pro Thr Thr Pro lie lie Ser 

35 ■ _ ' . : 40 ^ ' ' " . : ' / V ■ " 4 5 ' 









57 5 • 


579 




TAGGCAAAGA 


GAAGAGTGGT 


GCAGAGAGAA 


AAAAGAGCAG 


TG AC ACT ACG 


24 69 


AG CT ATG TTC 


CTTGGGTTCT 


TGGGAGCAGC 


AGGAAGCACT 


ATGGGCGATA . 


2519 


VAGCTTTAATG 


CGGTAGTTTA 


TCACAGTTAA 


ATTCGTAACG 


CACTCAGGCA 


25 69 


CCGTGTATGA 


AATCTAACAA 


TGCGACCTCC 


AGAAGCTTAG 


AACCGAGGAA 


2619 


CTTGTTTATT 


GCAGCTTATA 


ATGGTTACAA 


AT AA ACC A AT 


ACCATCACAA 


2669 


ATTTCACAAA 


TAAAGCATTT 


TTTTCACTGC 


ATTCTAGTTG 


TCGTTTGTCC 


2 719 


AAACTCATCA 


ATGTATCTTA 


TCATC5TCTGG 


ATCGGGAATT 


AATTCGGCGC 


27 69 


AGCACCATGG 


CCTGAAATAA 


CCTCTGAAAG 


A ■ 




2800 
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Tyr Pro 
50 


',**'.." 


Gly Leu 


Phe 


Cys 

.55. . 


Vai Val 


Arg 


Ser 


Gly 

60; 


Met 


Gly Gly 


Thr 


,5: 


Ala Ala Arg. Leu Gly 

.- 65 .." 


Ala 
70 


Val 


lie Leu 


Phe 


Val 
75 


Val 


lie Val 


Gly Leu 

. : 80 




• His Gly Val Arg Gly 

85 


Lys 


Tyr 


Ala Leu . 


Ala 
90 


Asp 


Ala 


Ser 


Leu 


Lys 
95 


Met 


10 




Pro • 


Asn Arg 
100 


Phe 


Arg 


Gly Lys- 
105 


Asp 


■Leu." 


Pro 


val 


Leu 
110 


Asp : 


Gin 




Leu Leu 


Glu 
115 


Val Pro 


Val 


Trp 


Lys Glu 
120 


Ala ; 


Asn 


Thr 


Thr 
125 


Leu 


Phe 


Cys 


15 


Ala Ser 
130 


Asp 


Ala Lys 


Ala 


Tyr 
13 5 


Lys Thr 


GlU 


Ala : 


His 
140 


Asn 


Val 


Trp 


Ala 


20 


Thr His 
.145 


Ala 


Cys Val 


Pro 
150 


Thr 


Asp Pro 


:Ly s 


Pro 

155: 


Gin 


Glu 


lie 


Lys 


Leu 
160 




Glu Asn 


Val 


Thr Glu 

165 


Asn 


Phe 


Ash Met 


Trp. 
170; 


Lys 


Asn 


Asn 


Met 


Val 
175 


Cl-U-_ 


25 


t» X. n rlcL 


His 


Glu Asp 
180 


lie 


lie 


Ser Leu 
185 


Trp 


Asp 


Gin 


Ser 


Leu 
190 


Lys 


Pro 




Cys Val 


Lys 
195 


Leu Thr 


Pro 


Leu 


Cys Va 1 
200 


Thr 


Leu 


Asn 


Cys 
205 


Thr 


Asp 


Leu 


30 


Arg Asn 
210 


Asn 


thr Asn 


Thr 


Asn 
215 


Ser Thr 


Tyr 


GTy : 


Lys 
220 


lie 


Met 


Glu 


Gly 




Gly Glu 
225 


tie 


Lys Asn 


Cys 
230 


Ser 


PKf? Ash 


lie 


thr 
235 


thr 


Ser 


lie 


Lys 


Asp 
240 




Lys Leu 


Lys Asp Met 
245 


Ser 


Leu 


Phe Tyr 


"Lys: 
250 


Leu Asp 


Val 


Val Pro. lie 
255 




Gly Asn 


Asn 


Ser Asn 
260 


Thr 


Thr 


Ser Tyr 

265 : 


Arg 


Leu 


lie 


Ser Cys 
270 


Asn 


thr 




Ser Val- 


lie. 
27 5 


Thr Gin 


Ala 


Cy S; 


Pro Lys 

'•280 : .:;; : :: : r 


':;thr: 


Ser 


Phe 


Glu 
285 


Pro 


lie 


Pro 


45 


ue His 
290 


Tyr 


Cys Ala 


Pro : 


a i a 
295 


■ fM \r Php 


:iAl.a : 


lie 


Leu 
300 


Lys 


Cys 


Asn 


Asp 


50 


Asn Lys 
305 : 


Phe 


Ash Gly 


Thr 
310 


Gly 


Pro Cys 


Pro 


Asn 
315 


Val 


Ser 


Thr 


Val 


Gin 
320 




Cys Thr 


His 


Gly He 
325 


Arg 


Pro 


Val Val 


. Ser 
330 


thr 


Cln. 


Leu 


Leu 


Leu 
335 


Asn 


55 


Gly Ser 


Leu 


Ala Glu 
.340 . 


Lys 


Glu 


Val. Vai 
: ; '345 


Leu 


Arg 


Ser 


Glu 


Asn 
350 


Phe 


Thr 



Asp Asn Ala Lys Thr lie He Val .Gin' Leu Asn - Glu Ser Val lie. lie. . 

■ . 355,. - ' 3^0 ['^ ^frU; v\."-. '. 365 

60 Asp Cys Met Arg Pro Asn Asn Asn - Thr Arg Thr Ser He Pro Met Gly 

■ 370 : . 375. . 380* 

Pro Giv Lvs Ala Phe Tyr Ala Thr G ly Asp V ai . 1 1 e G 1 y A s p lie . . Ar g • 

65 385 . ^ 390 " . -V 3 - 9 : 5 ': 400 
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Arg Ala His. Cys Asn lie Ser Arg Ala Gl y ; Trp As n Thr Thr Leu Gin 

- 405 ; : " ::; v:. :4 l ; P" ; .'. ; .' : " -. : ; 4.15v'. : 

Gin He Ala Lys Lys Leu Arg • Glu Lys • Phe Glu Asn Lys Thr lie Val 

5 ■ : ; 420 : v425. : :.:i. : v : 430 .\'\ : : 

Phe Asn His Ser Ser Gly Gly Asp Pro GluV lle Vai Met His Thr Phe 

435 • 440 ' 445 

10 Asn Cys Gly Gly Glu Phe Phe Cys Cys Asn Ser tf.hr Pro Leu Phe Asn 

450. . . 455 ■ 460 



.15 



3 0 



35 



Ser Thr Trp Ash Asp Ala Gin Leu Phe. Asn Ser Thr. Trp Asp Asp Thr 

465 .. 470 . . 475 " 480 

Lys Trp Ser Lys Gly Thr Asn Glu Asn Asp Thr lie Thr Leu His Cys 

485 490 495 



Arg He Lys Gin He lie Asn Met Trp Gin Glu Val Gly Lys Ala Met 
2 0 500 505 \ 510 

Tyr Ala Pro Pro lie Lys Gly Gln^He Arg Cys Glu Ser Asn lie Thr 
515 520 525 

25 Gly Leu Leu Leu Thr Arg Asp Gly Gly Ash Asp Thr Ser Lys Asn Asn 
530 535 540 



Thr Glu He Phe Arg Pro Gly Gly. Gly :Asn Met Lys Asp Asn Trp Arg 
545 . 550 . , 555 560 

Ser Glu Leu Tyr Lys Tyr Lys Val lie Lys IleGlu Pro Leu Gly Val 
565 570 575 

Ala -Pro He- 



(2) INFORMATION FOR SEQ ID NO: 32 r 

(i) SEQUENCE CHARACTERISTICS-;. . . . .. 

(A) LENGTH: 1533 base pairs V 
.40 (B) TYPE: Nucleic Acid- 

.(C) STRANDEDNESS : Single 
(D) TOPOLOGY: Linear Y 
(xi) SEQUENCE DESCRIPTION:: £>EQ I D U O : . 32 : 

45 ATGGGGGGGA CTGCCGCCAG GTTGGGGGCC GTG ATTTTG T TTGTCGTCAT . 50 

AGTGGGCCTC CATGGGGTCC GCGGC AAAT A TGCCTTGG CG GATGCCTCTC 100 

TCAAGATGGC CGACCCCAAT CGATTTCGCG GCAAAGACCT TCCGGTCCTG 150 

50 

GACCAGCTGC TCGAG GTA CCT GTG TGG. AAA GAA GCA ACC ACC 192 
Val Pro Val Trp Lys Glu Ala Thr Thr 

' - ...... . 1 • : • : . .'. : ••.•5: •'• .'■ ; - .. : . 

55 ACT CTA TTT TGT GCA TCA GAT GCT AAA GCA TAT G AT AC A GAG .234. 
Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp Thr Gl u 

10 ■ • - . 15 ' ■ 20 • ■ 

GTA CAT AAT GTT TGG GCC ACA CAT GCC TGT GTA CCC ACA GAC 276 
60 val His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp 

25 30 : 35 ' 
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CCC AAC CCA CAA GAA ATA GGA TTG GAA AAT /CTA : ACA GAA AAT : 318 
Pro Asn Pro Gin Glu 11^ Gly Leu Glu Asn ; Val VThr ;: Glu Asn 
: 40 45 • • 50 : 

'5^ TTT AAC A*TG T<3G A Aa AAT AAC ; ATG GT A . 6 A A: . G Ag. : :'.'ATG G ATT G AG 360 
Phe Asn Met Trp Lys Asn Asn Met V a 1 - G 1 u Gin Met : Hi s G 1 u 

55 :: . ■ 60. . 65 ■"■ 

GAT ATA AtC AGT TTA TGG GAT CAA AGC TTA AAG CCA TGT GTA 402 
10 Asp lie lie Ser Leu- Trp Asp Gin Ser Leu Lys .Pro Cys Val 

70 : : 75 

AAA TTA ACC CCA CTA TGT GTT ACT TTA AAT TGC VACT GAT TTG 444 
Lys Leu Thr Pro Leu Cys Val Thr Leu Asn :Cys Thr Asp Leu 
15 80 85 90 



50 



55 



60 



AAA AAT GCT ACT AAT ACC ACT AGT AGC AGC TGG GGA AAG ATG 486 
Lys Asn Ala Thr Asn Thr Thr Ser Ser Ser Trp Gly Lys Met 

2 0 95 100 : :. : :105 . : 

GAG AGA GGA GAR ATA AAA AAC TGC TCT TTG AAT .GTC ACC AC A 528 
Glu Arq Gly Glu lie Lys Asn Cys Ser Phe Asn Val Thr Thr 

HO . ^ 115 ,. 120 

2 5 ■ AGT ATA AGA GAT AAG ATG AAG AAT GAA TAT GC A : CTT TTT TAT 570 

- ■ Ser lie Arq Asp. Lys Met Lys Asn Glu Tyr : Ala Leu Phe Tyr 

125 130; ^:V:, ; . •. I* 35 

30 AAA CTT GAT GTA GTA CCA ATA GAT AAT GAT AAT ACT AGC TAT . 612 

Lys Leu Asp Val Val Pro lie Asp Asn Asp Asn Thr Ser Tyr. 

140'. 145 • 

AGG TTG ATA AGT TGT AAC ACC TCA GTC ATT AC A CAG GCC TGT 654 

3 5 Arc Leu He Ser Cys Asn Thr Ser Val lie Thr Gin Ala Cys, 

150 . .155 . 160 

CCA AAG GTG TCC TTT GAG CCA ATT CCC ATA CAT TAT TGT GCC 696 

4 0 Pro Lys Val Ser Pne Glu Pro l ie Pro : He: His; Tyr Cys Ala 

■-'^■■^r'cCG GCT GGT TTT GCG ATT CTA AAG TGT AGA GAT AAA AAG TTC: 738 
• :. : pro Ala Gly Phe Ala lie Leu Lys Cys Arg Asp Lys Lys Phe - , 

4 5 * 180 ■ ' 185 /:: ; :^:::::r.= i^^--V 1 ^: 

AAC GGA ACA GGA CCA. TGT ACA AAT GTC AGC ACA GT A ; CAA TGT;. 7 80 
Gly 
195 



AAC GGA ACA GGA CCA TGT ALft n«i w,.. : w.„'- . 
Asn Gly Thr: Gly Pro Cys Thr Asn Val Ser Thr - Val Gin Cys 
200 2U=> 



ACA CAT GGA ATT AGG CCA GTA GTA TCA ACT, CAA CTG CTG TTA 82 2 

Thr His Gly lie Arg Pro Val Val Ser Thr Gin. Leu: Leu Leu 

: . v , . 2io . ■■.;2:i5 - 

AAT GGC AGT TTA GCA GAA GAA GAA GTA GTA ATT ACA TCT GCC / • 864 

Asn Gly Ser Leu Ala Glu Glu Glu Val Val lie Arg Ser Ala : . 

220 ; . • : -225 • . ' , 230 : .; ■• .:■ 

AAT TTC TCG GAC AAT GCT AAA ACC ATA ' ATA GTA CAG CTG AAC .906 

Asn Phe Ser Asp Ash Ala Lys. Thr lie He Val Gin Leu Asn 

235 240 . 24 s . 
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.V.- ; v-V : : .v : GAA.'. TCT GTA GAA ATT AAT TGT AC A AG A CCC : A A C A AC A AT ACA 948 
Glu Ser Val Glu lie Asn Cys Thr Arg Pro Asn ; Asn Ash Thr 

250 . \::y.25S :\ W.V- 260 

:' : 5:'::;:'.'::::AGA AG A ACT ATA CAT ATA GG A : CCA GGG AG A GCA TTT TAT GCA 990 
Arg Arg Ser lie His lie Gly Pro Gly Arg Ala Phe tyr Ala 

265 ' 2 70 ' ': 275 

ACA GGA GAA ATA ATA GGA GAC ATA AGA CAA GCA CAT TGT AAC 1032 
1.0 Thr Gly Glu lie lie Gly Asp He Arg Gin Ala His Cys Asn 

280 . 285 . 

CTT AGT AGC ACA AAA TGG AAT AAT ACT XT A AAA- CAG ATA .'• GTT .1074 
15 Leu Ser Ser Thr Lys Trp Ash Asn Thr Leu Lys Gin Tie Val 
290 ' 295 • .300 

ACA AAA TTA AGA GAA CAT TTT AAT AAA ACA ATA GTC TTT AAT 1116 
Thr Lys Leu Arg Glu His . Phe Asn Lys Thr. lie Val Phe Asn 
20 : 305 310 315. ; . 

CAC TCC TCA GGA GGG GAG CCA GAA ATT GTA ATG CAC : AGT TTT 1158 

His Ser Ser Gly Gly Asp Pro Glu lie Val Met His Ser Phe 

320 325 3 30 

25 

AAT TGT GGA GGG GAA TTT TTC TAC TGT AAT ACA ACA CCA CTG 1200 

.Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn. Thr Thr Pro Leu 

335 340' _ 345 - .". 

30 TTT AAT AGT ACT TGG AAT TAT ACT TAT ACT TGG AAT AAT ACT . 1242 
.Phe Asn Ser Thr Trp Asn Tyr Thr Tyr Thr Trp Asn Asn Thr 

350 ■"' 355 . 

GAA GGG TCA AAT GAC ACT GGA AGA AAT ATC ACA CTC CAA TGC 1284 
35 Glu Gly Ser Asn Asp Thr Gly Arg Asn Ile.Thr Leu Gin Cys 
.360 . 365 . 370 : : 

AGA ATA AAA CAA ATT ATA AAC ATG TGG CAG GAA GTA GGA AAA 1326 
Arg lie Lys Gin lie lie Asn Met Trp Gin Glu Val Gly Lys 
40 : 375 380 385 

GCA ATG TAT GCC CCT CCC ATA AGA GGA CAA ATT AGA TGC TCA 1368. 

• Ala Met Tyr Ala Pro Pro He Arg. Gly Gin lie Arg Cys Ser 
4'5 : • 390 ' 395 400 

TCA AAT ATT ACA GGG. CTG CTA TTA ACA AG A : GAT G G T ; G G T A AT 1410 

Ser : Asn He Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Asn 

405 ■ 4 10 : -415 ■ ■ 

50 ' 



AAC AGC GAA ACC GAG ATC TTC AGA CCT GGA GGA GGA GAT ATG 1452 
Asn Ser Glu Thr Glu lie Phe Arg Pro Gly Gly Gly. Asp Met 
420 - 425 . 



55: : AGG GAC AAT TGG AGA ACT GAA TTA TAT AAA TAT AAA GTA GTA 14 94 

Arg Asp Ash Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val 

: 430 ■ -435'; • ':' ' . 44 0: 

AAA ATT GAA CCA TTA GGA GTA GCA CCC ACC A AG GCA TAA 1533 

60 . Lys lie Glu Pro Leu Gly Val Ala Pro Thr Lys Ala * 

• ■ 445 450 ' • ;'\y455' 



WO 98/01564 



PGT/US97/09690 



10 



20 



25 



30 



(2) INFORMATION FOR SEQ ID NO: 33: . : . 

;;;;';:'(:if: J :SE5lJENGE CHARACTERISTICS : 

' (A) LENGTH: 456 amino: acids ; 
• (B) TYPEr Amino Acid v 

V:-:'(:D:) TOPOLOGY: L inear ; 
(xi)v SEQUENCE DESCRIPTION: SEQ: ID NO: 33: 

Val Pro ' Val Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser : Asp- : . 

- i :: • 5 . 10 ■ ■ ^--p- 15 : 

Ala Lys Ala Tyr- Asp Thr Glu Val His Ash Val Trp Ala Thr His Ala, 
20 25 30 

Cys Val Pro Thr Asp Pro Asn Pro Gin Glu lie Gly Leu Glu AsnVal 

15 : , : , .'35 : ' l - ; 40 - ■ "V, ^::.;^-,r :; ''" 

Thr Glu Ash phe Asn Met Trp Lys Asn Asn Met Val Glu Gin Met His 

50 55 . . ' 60 / . 

Glu Asp lie He Ser Leu Trp Asp Gin Ser Leu Lys Pro, Cys Val Lys 

65 ■ 70 • 75 ■ ; 80 .; 

Leu Thr Pro Leu Cys Val Thr Leu Asn Cys Thr Asp Leu Lys - Asn Ala ; 

85 90 95 

Thr Asn Thr Thr Ser Ser Ser Trp Gly Lys Met Glu Arg Gly Glu lie 

. .. 100 : 105 • HO . 

Lys Asn Cys Ser Phe Asn Val Thr Thr Ser lie Arg Asp Lys Met Lys 

US 120'- ■ 125... 

Asn Glu Tyr Ala Leu Phe Tyr Lys Leu Asp Val Val Pro He Asp Asn 

130 . . •• 135 ' ,. . . 140 

35 Asp Asn Thr Ser Tyr Arg Leu lie: Ser Cys Asn Thr Ser Val l ie Thr 

. I4 5 150 : .. ■ 155 : 160 

Gin Ala Cys Pro Lys Val Ser Phe Glu. Pro lie Pro: Tie His Tyr .Cys 

' -165- , 170 ■■ ■'•1/=:' '■ .■ 

Ala Pro Ala -Gly Phe -Ala lie Leu Lys .Cys Arg Asp. Lys Lys Phe Asn 

. 180'. : ;185.,: 190 

Gly Thr Gly Pro Cys Thr Asn Val Ser Thr Val Gin; Cys Thr His Gly 

195 200", • • 205 

lie Arq Pro Val val Ser Thr Gin Leu Leu Leu Asn Gly Ser Leu Ala . 

■ -210^: _ - 215 . ■ . 2.20 . ; 

Glu Glu Glu Val Val lie Arg Ser Ala Asn Phe Ser Asp Asn Ala Lys 
225 230 ,235 240 

Thr lie lie Val Gin Leu Asn Glu Ser Val Glu lie Asn Cys Thr Arg. 

. Y 245.. 250 . , . .255. - ^ 

Pro Asn Asn Asn Thr, Arg Arg Ser lie His lie Gly Pro Gly Arg -Ala ^ 
- 260 ' . . 2.65 ; 270 , 

Phe Tvr Ala Thr Gly Glu lie lie Gly Asp He Arg Gin Ala His Cys 

60 27 5 ■ ■ ■" " 280 . , . [ , 285 . 

Asn Leu ser Ser -Thr Lys Trp Asn Asn Thr Leu Lys Gin : lie Val Thr, 

• ■■ 290 : ' 295 • • 300. • 



40 



4:5 



50 



55 
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Lys Leu Arg Glu His Phe Asn Lys Thr He Val : Phe Asn His: Ser Ser 

305 ::--V; ; ;:. :: : : y^l:Q.::;; ^"V." ' ■ :. : - 3'1 5/ : V ; _ 320.'. 

: Gly G 1 y Asp v f>r o G 1 u He V a 1 Met His Ser Phe Asiv Cys Gly Gly ;Glu 

5 • "^b\h\\iii^!32S ■■ :- : - : ; : ;. :; "--330". : ' ; -^33.5 ; v 

Phe Phe Tyr Cys : Asn Thr Thr Pro Leu : Phe Ash Ser Thr Trp Asa Tyr 
• ■ -340 ' -345 . : ■ =3S0 : 

10 Thr Tyr Thr Trp Asn Asn Thr Glu Gly Ser Asn , Asp Thr Gly Arg Ash 

355 360 365 



15 
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50 



He Thr Leu Gin Cys Arg He Lys Gin He He Asa Met Trp Gin Glu 

370 - .. ■ ■ : . 3.75 ■ ; 380 • 

Val Gly Lys Ala :Met Tyr Ala Pro Pro He Arg Gly Gin lie Arg Cys 

385 .... 390 395 . .400; 



Ser Ser Asn ..He. Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Ash Asn 
2 0 405 410 415 

Ser Glu Thr Glu lie Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn 

420 V . ■ ■ ' 4 25 430 

25 Trp Arg Ser Glu Leu Tyr .Lys Tyr Lys Val Val Lys lie Glu Pro Leu 

: -'43 : 5. ; v' : '44.0 44 5 



Gly Val; Ala;; Pro Thr Lys Ala * 
450 . • 455 ■ " 



(2) INFORMATION FOR SEQ ID NO: 34: 
(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH:: 37 base pairs 
35 (B.) TYPE: Nucleic Acid 

( Q ) iSTRANDEDNESS : : Single 
(D) TOPOLOGY: Linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

4 0 GGGAATTCGG A T CC AG AG G A G A AG AC AG TG CCAATGA : 3 7 



(2) INFORMATION FOR: SEQ ID NO: 35 : 
( i) SEQUENCE .CHARACTERISTICS:: 
45 (A) LENGTH: 33 base pairs 

(B) TYPE: Nucleic Acid 

(C) ; S TR A NDEDNESS: Single 

(D) ; TOPOLOGY: Linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 



CTCG AGCTCC TG A AG AC AG T : CAG ACTCATC AAG 33 



(2) INFORMATION FOR SEQ ID NO: 36: 
55 { i) SEQUENCE CHARACTERISTICS : . 

(A ) LENGTH : 39 base pairs • 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single . 

(D) TOPOLOGY: Linear 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 



GGTCTAGAAG CTTTAGCCCA TAGTGCTTCC TGCTGCTCC 39 
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(2) INFORMATION FOR SEQ ID NO :37 : 
7\\ :: . ( i j SEQUENCE CHARACTERIST : : 

-' ( A) LENGTH:; 36 base pairs • • 
( B ) TYPE: Nucleic Acid 
{ C) STRANDEDNESS : S ingle 
(D) TOPOLOGY : : Linear 
(xi) SEQUENCE DESCRIPTION: SEQ. ID NO: 37: 

GGGCGGATCC TCGAGGTACC TGTRTGGAAA GAAGCA 36 
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(2) INFORMATION FOR SEQ; ID NO: 38: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 38 base pairs 
15 ( B ) TYPE : Nucleic;; Acid 

(C) STRANDEDNESS : S ingle 

(D) TOPOLOGY : Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:38: 

20 GGTCTAGAAG CTTTATGCTC CYAAGAACCC AAGGAACA 38 



4 5 



(2) INFORMATION FOR SEQ ID NO: 39: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino, acids 
{ B ) TYPE: Amino Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY:. Linear 

30 (xi) SEQUENCE DESCRIPTION : SEQ ID NO: 39 : 

lie Gly Pro Gly Arg Ala Phe 

1 . 5 ... 

35 (2) INFORMATION FOR SEQ ID NO : 40 : 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 7 amino acids 
<B) TYPE:- Amino Acid 
(C) STRANDEDNESS: Single 
40 (D) TOPOLOGY: Linear 

(xi ) SEQUENCE DESCRIPTION: SEQ. ID NO:40: 



He Gly Pro Gly Arg Ala Trp 
(2) INFORMATION FOR SEQ ID NO: 41: 



• (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 
50 (B) TYPE:: Amino Acid 

(C) STRANDEDNESS: Single 
"(D) TOPOLOGY:: Linear 

(xi ) SEQUENCE DESCRIPTION : SEQ ID NO: 41: 

55 Leu Gly Pro Gly Ser Thr Phe 

V : .' '•• ' . .1 " - Si .::X:^::: / . . . • • ' . 

( 2) INFORMATION FOR SEQ ID -NO: 42 : . 
(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 7. amino acids 

(B) TYPE: Amino, Acid. 

(C ) STRANDEDNESS : : Single / 

(D) TOPOLOGY: Linear / 
(xi) SEQUENCE -DESCRIPTION: SEQ ID NO:42: 



WO 98/01564 PCT/US97/09690 



lie Gly Pro. Gly Arg Val Leu; . . 

■ 1 : , • ' ' ■ 'S'..~A ^V:^:;:^ v. ' - ,. ' 

(2) INFORMATION FOR SEQ ID NO: 43: 
5 ■ ( i) SEQUENCE . CHARACTER! ST ICS : : : 

(A) LENGTH: . 7 amino; acids 

(B) TYPE: Amino Acid v. . 

. (C) STRANDEDNESS : : Single-: 
(D) TOPOLOGY: Linear 
10 (xi) SEQUENCE DE SCRIPT! ON: SEQ ID NO: 43: 

lie Gly Pro Gly Ser Ala Phe 

1 , •■ " 5 ■ . . ' 

15 (2) INFORMATION FOR SEQ ID NO: 44 : 
(i) SEQUENCE CHARACTERISTICS: 
(.A) LENGTH: 5 amino acids 

(B) TYPE: Amino Acid 

(C) STRANDEDNESS: Single . , 
20 (D) TOPOLOGY: Linear • '• 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 



lie Gly Pro Gly Arg 
1 5 



25 



-162- 



WO 98/01564 



PCT/US97/09690 



WHAT IS CLAIMED IS: 

1. An Isbl^ted polypeptide cppprising an HIV gp!20 
amino acid sequence selected; f rori; the group : 

; 5; consisting of Sequence ID Nosv 2 r 4, 6, 8, 10, 12, 

14, 16, 18/20, 22, 24, 2 6, and 28, and fragments 
thereof.. ; ' 

2. The polypeptide of Claim 1 wherein the polypeptide 
10: additionally comprises a flag epitope sequence. 



15 



3 . The polypeptide of Claim 2 wherein the flag 
epitope sequence is HSV gD-1 flag epitope 
sequence . 

4/ The polypeptide of claim 2 wherein the flag 

epitope sequence is fused to the HIV gpl20 amino 
acid sequence. 

20 5- An oligonucleotide of not more than five kilobases 
encoding an HIV gpl20 polypeptide sequence 
comprising an amino acid sequence selected from 
. , the group consisting of Sequence ID Nos. 2, 4 , 6, 
8, 10, 12, 14, : 16, 18> 20, 22, 2 4. 26, and 28, and 

25 ';{;'- ' f r agmehts ; there of . ■ ' , - 

6; The oligonucleotide of Claim 5 wherein the 

oligonucleotide includes a nucleotide sequence 
. selected from the group consisting of Sequence ID 
,■30;^ 9, il, 13, i5>^ : ; : .;.19, 21, 23, 25, 

. , and 27 , ^nd f ragments thereof . ; 

7. The diigonucleoti^ Claim 5 wherein the amino , 

acid sequence encoded by the ; oligonucleotide 
35 additionally comprises a flag Epitope. 
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8« The oligonucleotide of 

, epitope is HSV ; gp^i : :; f lag ■. ^pit6p^V : : ; : ,:, ; ' : ; 

9. The oligonucleotide of Claim. 7 wherein the flag 
5 epitope is fused to the HIV gpl2 0 amino . acid 

.sequence. ■ 

10 . A vaccine comprising gp!2 0 MN and an HIV ,gpl20 
polypeptide sequence selected from th£ group 

10 • consisting of Sequence ID Nos. 2, 4, 6, 8, ID, 12 , 
. 14, 16, 18, 2 0, 22 , 24, 26, and 28, and fragments 

thereof, in a suitable carrier . 



11. A vaccine comprising: 
15 a . a first gpl2 0 polypeptide sequence or a 

fragment thereof ; and 
•'•'v b. a breakthrough isolate HIV gpi:2 0: polypeptide 
sequence or a fragment thereof from a 
vaccinee vaccinated v/ith said first HIV gp!20 
20 polypeptide sequence; 

wherein said HIV gpl2 0 polypeptide sequences are 
in a suitable carrier. 

: 12:.v^ The vaccine of Claim 11 where in said first HIV 
25. ;: : gpl2 0 polypeptide sequence comprises gpl2 b ; MN , 

gp 12 0 A2 4 4 , gpl20 MN-GNE6 (Sequence ID -No. 31) , or 
gpl2 0 MN-GNE8 (Sequence ID No. 33)| 

13 . The vaccine of Claim 12 wherein said vaccine: 
30\ ; additionally comprises a second gpT20 polypeptide 
; sequence compr is i rig gp 1 2 0 MN, gpi2 0 A244 , gpl2 0 
■ MM-.GNE6 (Sequence ID No. 31) , or gp!2 6^ MN-GNE8 
(Sequence ID No. 3 3 ) , or a fragment thereof, . 
wherein said second HIV gpl2 0 polypeptide sequence 
35 ... is different from said first HIV gpl20 polypeptide 
sequence . 
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;i4v. -tfH&fecclhi of .Glaimri? wherein s^id jfirst ;gpi?Q . . 

'■■^Wpeptide sequence .ppinprises gpl20 MN and said 
. • ■ second gpl 20 polypeptide, sequence cpinprise? gpl? 0 

A24 : 4.:-.; ' ' .'. ; . ; 7. : . v; : ' : ;;:;; i : :: v : .;v:;:;^e : ' : -' ; 

15; The vaccine of Claim 14 wherein said breakthrough 
.isolate comprises art: HIV gpl 2 0 polypeptide 
sequence selected from the group consisting of 
Sequence ID Nos. 2, 4> 6, 8, 10, 12, 14, 16, 18/ 
20, 22, 24, 26, and 28, and fragments' thereof in a. 
suitable carrier. . 



16. The vaccine of Claim 13 wherein said first gpl20 
.polypeptide sequence; comprises gpl20 MN and said 
15 second gpl20 polypeptide sequence comprises gpl20 

MN-GNE8 : (Sequence ID No. 33). 



17 



The vaccine of Claim 16 wherein said breakthrough 
isolate HIV gpl20 polypeptide sequence is an HIV 
20 g pi20 polypeptide sequence selected from the group 

consisting of Sequence ID Nos. 2, 4, 6, 8> 10, 12, 
. : ^4; 16, 18, 20, 22, 24, .26, and 28, and fragments 
thereof, in a suitable carrier. 



25 



18 . The vaccine of Claim 13 wherein said breakthrough 
Isolate HIV gpl 20 polypeptide is from a vaccinee 
vaccinated with said; first and second HIV <3p^20;; 
•-■polypeptide sequences'.- 
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.^: - -::^19.: A method ■ f or^wak-ing'" an HIV y a cc i h e comp rising 
adding an - sequence or 

fragments thereof ; from a breakthrough isolate from 
."' a vacpinee to the vaccine with which : the vaccinee v 

• 5. ...... . was vaccinated . 

" 20. The vaccine of Claim 11 wherein said first gpl20 / 
polypeptide sequence is from a macrbphage-tropic : 
HIV-1 strain. 

21 . The vaccine of Claim . 11 wherein said first gpl20 

polypeptide sequence is from a T-eell-tropic. HIV-1 
. strain. .•• . 



15 22 . The vaccine of Claim 21 wherein said vaccine 

additionally comprises a second. gpl2 0 polypeptide 
sequence or a ^fragment, from a macrophage-tropic 
HIV-i strain. 

20 23. The vaccine of Claim 22 wherein said first and 
second gpl2 0 polypeptide sequences bind to 
different chemokine receptors. 

' ; "24 . The vaccine of Claim 23 wherein said first gpl20 
25 p o 1 y pe p t i d e s e q u e n c e binds to CC-CKR-5 and said 

second gp 120 polypeptide sequence binds to CXC- 
. CKR-4. 

.' ''25.. The vaccine of : Claim 11 wherein said vaccine 
30 additionally : comprises an virus engineered to 

induce a cytotoxic T-cell response. 
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